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(57) ABSTRACT 

A translation management system in a computer environ- 
ment. A preferred embodiment of the invention automati- 
cally detects when a document, data stream, or non-text file 
in the master language has been updated and notifies the user 
which corresponding documents, data streams, or non-text 
files in the other languages require translation which are then 
staged and dynamically routed and sequenced to individual 
translation resources where the actual translation is per- 
formed. Management status, reporting, scheduling, and 
accounting information is sent to the user as the translation 
process ensues. The user is notified of the completion of 
translation and the invention coordinates the delivery of the 
translated documents, data streams, or non-text files back to 
the user's site for installation and optional review. 

24 Claims, 12 Drawing Sheets 
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TRANSLATION MANAGEMENT SYSTEM 

This application claims benefit of Provisional Applica- 
tion No. 60/075,740, filed Feb. 23, 1998. 

BACKGROUND OF THE INVENTION 

1. Technical Field 

The invention relates to the multilingual translation of 
documents in a computer environment. More particularly, 
the invention relates to the management of monolingual 
documents, data streams, non-text files, and databases to 
enable their translation into target languages in a computer 
environment. 

2. Description of the Prior Art 

The surge of growth in Internet access across the world 
has created the need for multilingual Web sites. The majority 
of Web sites are created in English which creates a problem 
for non-English speaking Internet users. This also creates a 
problem for Web site managers that maintain Web sites that 
are, or attempt to be, multilingual. The Web site content 
must be constantly maintained to be up to date and synchro- 
nized in all of the languages supported by the Web site. 

One approach is shown in PCT International Publication 
Number W097/18516 of Flanagan et al., published on May 
22, 1997. This application integrates a machine translator in 
the end user's Web browser. The user selects the desired 
target language and the Web document retrieved by the 
browser is then translated by the machine translator. 
Unfortunately, errors occur in the translation and the appro- 
priate content is not consistently displayed to the user. 

An alternative implementation in the above application 
places pre -translated Web pages on a Web server. The Web 
server stores all of the translated Web pages locally. The 
Web site visitor sends a request to the Web server for a page 
in a specific language. The Web server searches for the 
pre-translated page that has been stored locally and sends it 
to the Web visitor's Web browser. This creates a storage and 
file management problem on the Web server because of the 
duplication of each entire page of the Web site. 

Some search engines on the Internet offer an option to 
translate the hits that come back from a search query into 
different languages. The user initiates the translation through 
a menu selection. The translation is performed automatically 
by machine. 

Japanese Pat. No. 9-114852 is a method for a search unit 
which takes a search string in a second foreign language and 
translates it into a first foreign language. The translated 
search string is used to search the original document which 
is in the first foreign language. 

Japanese Pat. No. 9-44502 is a method wherein a docu- 
ment in a first foreign language is translated into a second 
foreign language. The two documents are then displayed 
separately or together through a user interface. 

European Pat. No. 0376741 Bl issued to Francisco on 
Oct. 18, 1995, is a method for displaying error messages on 
a document collating and envelope stuffing machine. Said 
error messages exist simultaneously in a plurality of lan- 
guages and are displayed in the language specified by the 
user. This approach has storage and maintenance problems 
because each error message must be duplicated in its entirety 
for each language. 

European Pat. Appl. No. EP0774722A2 of Microsoft 
Corporation published on May 21, 1997, is a method for an 
information retrieval system that separates the design and 
content components of a document page. Pages are created 
with controls that define areas for content to be inserted into 
said pages. 
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It would be advantageous to provide a translation man- 
agement system that gives the user local control of the 
multilingual translation of electronic content. It would fur- 
ther be advantageous to provide a translation management 
5 system that allows the user to easily manage, schedule, and 
track translation resources and the content that are sent to the 
translation resources, thereby enhancing the maintainability 
of the system as a whole. 

SUMMARY OF THE INVENTION 

The invention provides a translation management system. 
The invention utilizes an intuitive user interface for man- 
aging document translation for multilingual Internet Web 
sites, documents, data streams, and non-text files, enabling 

15 the user to incrementally update the language content of a 
Web site or document and enhancing the maintainability and 
storage of multilingual electronic content. 

A preferred embodiment of the invention automatically 
detects when a document, data stream, or non-text file in the 

20 master language has been updated and notifies the user 
which corresponding documents, data streams, or non-text 
files in the other languages require translation. The 
documents, data streams, or non-text files requiring trans- 
lation are staged and dynamically routed and sequenced to 

25 individual translation resources where the actual translation 
is performed. 

Management status, reporting, scheduling, and account- 
ing information is sent to the user as the translation process 
ensues. The user is notified of the completion of translation 

30 and the invention coordinates the delivery of the translated 
documents, data streams, or non-text files back to the user's 
site for installation and optional review. 

The invention makes a variety of translation resources 
instantly available to the user These translation resources 

35 include both automated translation tools as well as human 
translators. The translation resources are connected to the 
invention using a flexible architecture that can be deployed 
on intranets as well as the Internet. 

^ Other aspects and advantages of the invention will 
become apparent from the following detailed description in 
combination with the accompanying drawings, illustrating, 
by way of example, the principles of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

45 FIG. 1 is a block schematic diagram of the major com- 
ponents of a preferred embodiment of the invention accord- 
ing to the invention; 

FIG. 2 is a block schematic diagram of the placement of 
the Visitor module in a network environment according to 
50 the invention; 

FIG. 3 is a block schematic diagram of a decision flow 
diagram of the Visitor module according to the invention; 
FIG. 4 is a block schematic diagram of the major com- 
55 ponents of the Developer module according to the invention; 
FIG. 5 is a block schematic diagram of the placement of 
the Developer module in a network environment according 
to the invention; 

FIG. 6 is a block schematic diagram of the major com- 
60 ponents of the Workflow Manager module according to the 
invention; 

FIG. 7 is a block schematic diagram of a decision flow 
diagram of the Manager's Console module according to the 
invention; 

65 FIG. 8 is a block schematic diagram of the major com- 
ponents of the Manager's Console module according to the 
invention; 
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FIG. 9 is a block schematic diagram of the major com- accessed localized content is placed into a Cache 206. The 

ponents of the Adaptor segment of the Workflow Pipeline content is placed in the Cache 206 so that if a similar request 

module according to the invention; comes in for a document in that language and for that 

FIG. 10 is a block schematic diagram of a decision flow country, then the cached version will be pushed out to the 

diagram of the Adaptor module according to the invention; 5 browser 201. This saves time and processor overhead for 

FIG. 11 is a block schematic diagram of the major accessing the database and file system 204 to rebuild the 

components of the Adaptor module according to the inven- requested content. In addition, Visitor 202 informs the 

tion* and browser 201 of the proper font and content encoding needed 

... f . , . to display the selected language and enables the browser 201 

, ™- a * ock $am ^ t d ?Sn«n of the placement of lfl to ^ font ^ * f { B itstream's True- 

the Workflow Manager module in a network environment Dqc technology if ire ^ 

according to the invention. A m ,„ . ' , . n 

^ The Visitor module intercepts input text that is submitted 

DETAILED DESCRIPTION OF THE using an HTML form (e.g., a customer feedback form), and 

INVENTION writes it into a form database 205, in a manner so that it is 

15 easily translated later via the Workflow Manager. Most Web 
The invention is embodied in a translation management sites have forms that allow site visitors to submit comments 
system for computer applications. A system according to the or request more information from the site host. This feed- 
invention provides an intuitive user interface for managing back might not be in English and would otherwise be 
document content translation for multilingual Internet Web incomprehensible to the recipient in a multilingual environ- 
sites, documents, data streams, and non-text files, enabling 20 me nt. 

the user to incrementally update the language content of a ^ form ^ base 2 0S includes sufficient information to 

Web site or document and automatically initiate the trans- identify the language and encoding of the text to 

ation of the content into the corresponding target languages, properly interpret it for subsequent translation. Furthermore, 

keeping the multilingual content synchronized and enhanc- me ^y^^ mcludes a novel database viewer which allows 

ing its maintainability and storage. Many existing applica- 25 ^ contem to te m me of me form 

tion programs use approaches that make it difficult for the m which it was origmal i y enterea \ 

user to manage the translation and synchronization of mul- _ _ ^ , , , 

tilingual content Referring to FIG. 3, the Visitor module determines the 

„ ' . .... Web site visitor's language and country 301 from one or 

A preferred embodiment of the invention provides a more of me following criteria: 

comprehensive suite of three modules that allow a user to 30 , - . . L 

build, operate and maintain a multilingual Web site easily Unguagp prcfcrcncc a*** m thc browscr 

and efficiently. The invention enhances an existing Web site Cookie from previous visit to the Web site 

by adding advanced multilingual content management and Root domain of the visitor 

process control capabilities to the customer's Web server. It Manual selection by visitor 

also uses standard and open interfaces thus enabling the 35 u^age com^t available on the server 

invention to work with all Web servers on supported plat- r , , 4 . c . , t . . 

f OTms Local database of countries and languages of the world 

Each module is easy to set up and does not require special ^ « *» ™* ** 

modifications to the Web site The invention provides an ^ Vi "* r 1 m ? tak l f"^" 5 ~ n,en , t , U1 aU laDgU f geS and 

easy migration path from early-stage customers who are 40 "M*^ .mehdng JJmcodc |. Manual language select™ is 

, • . „ t , ■ _ . * ** * <c * easily implemented through the use of a special command 

only interested in measuring overseas visitor traffic to more . : * , . . . ... j , iL Y . . 

sophisticated customers who need to manage the rapid 861 * ? e ! cribed 'f r » doCUment trrus ehxnmahng the 

translation of changing content on large, attributed multi- need . io ! CG } S £P* ° r ™«?P«*»f P*8"- ™* 

lingual Web sites. required content for the Web site visitor's language and 

. 45 country is checked to see if has been previously requested 

Referring to FIG. 1, the invention provides three compo- and rcsidcnt ^ ^ cachc If ^ roiltcnt js m mc cache> 

nents. The Visitor module 101, the Developer module, and thcn it fa delivcred {l0m the 30 3. 

the Workflow Manager module 103. Each module accesses If me rcquired mnMi ^ not m thc cache> then ±c Wcb 

one or more language and country databases and file systems fc DOtified of me appropriate localized content 

104. The Visitor 101, Developer 102, and Workflow Man- 50 ^ quirtd 304. Th e appropriate content is then placed into the 

agerira modules can operate in the same environment or in cacbe for fatuI€ reference 305 . The Web site visitor's 

a standalone mode. country and language preferences are recorded in the server 

Hie Visitor module greatly enhances the multilingual Web i 0 g 306. The enhanced Web server log gives the site man- 
site visitor's experience by providing an automated and a ger a detailed breakdown of country and language for 
seamless way to serve content in the correct language. Web 55 visitors to the Web site. Finally, the browser is notified of the 
site publishers can serve all their multilingual content proper display font and allowed to download the font 307. 
through a single point of entry using the Visitor module. I nput text _t ha t is submitted using an HTML-form is 
Web site visitors will immediately understand the informa- intercepted 308. The text is converted to an internal format 
tion they see when they enter a site because it will be 309 and is placed in the fo rms datab ase 310 for later 
instantly presented in their language and for their country. 50 translation. ' 

With respect to FIG. 2, the Visitor module 202 works in The Developer module enables the Web site developer to 
tandem with a customer's existing Web server 203. It build a single 'master* site which is subsequently displayed 
automatically determines the language and country of a Web in any number of languages, compared with conventional 
site visitor and directs the Web server 203 to deliver the methods whereby the site is re-engineered for each lan- 
appropriate localized content contained in one or more 65 guage. 

country/language databases and/or file-based content in a Referring to FIG. 4, the Developer module contains all of 

file system 204 to the visitor's browser 201. Recently the features of the Visitor module 401 which records the 
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Web site visitor's country and language preferences in the embedded within Javascript code. The Javascript code is 

server log 404. embedded within the HTML stream that is sent back to the 

A toolkit 402 is provided which allows a master site to be server. The HTML "body" tag has an optional attribute 

built that is language and country-independent. The actual where Javascript code can be inserted after the tag. The 

language content is placed in a language and country data- 5 informational text contained within the Javascript code is in 

base and/or file system 403 where it is easily managed and english and enclosed within the appropriate system tags 

maintained When a visitor enters the site, the requested described below, enabling it to be replaced with its transla- 

document is automatically served in the visitor's language tion in the most recently valid language of the visitor. The 

and for the visitor's country by filling in a document informational text has been pretranslated into all of the 

template from the master site with the correct language and to languages in the system database for the customer. This 

country content from one or more databases. allows easy access to the appropriate translation for the 

This structure makes the site more compact, reduces site informational text based on the most recently valid language 

resource requirements and lowers the cost of operating the for the visitor's country and language, 

site. This allows a multilingual Web site to be built 

The cost of making a design change is also reduced as is incrementally, such that only a portion of the documents on 
only the master site needs to be updated. This is in contrast the site need be translated and documents not available in a 
to the more time-consuming and expensive approach of requested language will be served in the next most appro- 
propagating a single change throughout all language ver- priate language for a Web site visitor, 
sions of the site which is unnecessary with the invention. This also gives the customer the option of translating only 

Keeping the translated content in a database 403 also 20 a subset of the total content on the site. The Developer 

makes it extremely easy and cost-effective to add additional module controls all aspects of content navigation and deliv- 

languages to the site. Adding a language simply involves ery for the entire site so that a visitor will always be served 

translating the appropriate content and creating new entries in the most appropriate language, based on what language 

in the database 403. The Developer module makes it unnec- content is actually available on the server. The visitor will 

essary to replicate the entire site for each new language and 25 never see an error message (i.e. a "404 -document not 

country as would typically be the case. found") because the requested content is not available in the 

The toolkit 402 offers the multilingual Web designer a visitors language, 

great deal of flexibility. It allows a site to be built which Regions are mapped and organized according to the Web 

combines content common to all languages with content that site's needs. For example, a multi-user server can have a 

is specific to a particular language and country. For example, 30 separate region definition for each user's site. The regions 

company-wide information which needs to be served in all file organizes the regions, countries, and languages that the 

languages is easily combined with regional-specific infor- user is concerned with and is configured by the user. The 

mation that may only be required in a single language. user sets up a mapping, for example, one user can have 

The invention provides special tags that are used to insert information specific to Italy and another user can have 

language or country-specific content into an HTML docu- 35 information that is just concerned with the European region 

ment. The tags are: Multi-country server-side includes and is not concerned with Italy. This approach allows a 

(MCSSI); and Multi-language server-side includes hierarchical region-based lookup scheme. Content is hier- 

(MLSSI). MCSSI allows locale-specific elements of an archically stored as country and language independent ele- 

HTML document to be dynamically included as a function ments such that a one-to-many and many-to-one mapping 

of the current region or country, while MLSSI allows 40 exists between country and language. The advantage to this 

localized elements of an HTML document to be included as approach is that there is no content duplication; the user has 

a function of the current language. only as many files in the system as he has unique content. 

The Developer module also supports the incremental With respect to FIGS. 4 and 5, the toolkit 402 also features 

construction of a multilingual Web site through a feature that a novel mechanism to create localized content for specific 

makes it unnecessary to fully populate the site's language 45 geographic regions or countries by using a template-based 

matrix completely with documents. If a requested document approach to dynamically create documents tailored for a 

does not exist (i.e., it has not been translated into the specific language or country. This feature makes it easy to 

requested language), the invention automatically retrieves create a true global site localized for each area of the world 

the document in the next most appropriate language for the with the smallest achievable site footprint on the Web server 

visitor, or the default language of the document, as specified 50 503. 

by the Web site manager. Documents can exist on separate A template contains placeholders for country and 

servers and/or databases. language-specific information that has been removed from a 

The invention walks down a priority list of languages for document. This information is dynamically inserted from a 

a designated country. If a document in the language that is TermDB 508 (an external glossary), another template or 

first on the priority list is not present on the server, it then 55 document located in a database or file system 509, or 

looks for the next most appropriate language. The list is provided automatically by the D eveloper module 5Q 2_when 

sorted according to the most prevalent language or dialect the composite document is presented to the browser 501. 

for a particular country. For example, the languages entries The basic approach to creating a template has two steps, 

for Egypt are {arabic, french, english, berber}. A visitor First, remove as much country and language-specific infor- 

from Egypt will be presented with a requested document in 60 mation as possible from a document. Then, replace this 

Arabic, if it is available. If it is not, the system will look for information with appropriate tags and commands specific to 

one in French, and so on. the invention and described later in this document. 

When a visitor is presented with content in an alternate The advantage to using templates is that a single docu- 

language, an informational text message can be included ment can easily support many languages and countries. For 

telling the visitor that the preferred language was not avail- 65 example, a single form or CGI -generated document can be 

able. The informational text is supplied in the most recently constructed so that it will be automatically localized for 

valid language for the visitor's country and language and is different languages and locales. This significantly reduces 
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the number of documents that have to be maintained on the The Manager's Console is the user interface for the 
site and makes it very easy to add new languages. It also Workflow Manager and is the primary point of interaction 
allows a single update to a document to be immediately for the Web site manager. 

propagated to all languages and countries. Referring to FIG. 7, the Manager's Console detects when 

This technique allows country or regional content to be 5 a document in the master language has been updated 705. It 
conveniently separated from the structure of the document, mcn notifics tbe Wcb sitc manager which corresponding 
thereby making it easy to change a design often by updating documents in the other languages require translation 706. 
a single file. This in turn eliminates the ripple effect often Xhe ^ Console provides "one-touch" translation, 
seen when a single change must propagate tnrougfi all of the whereb me click of a J documents from any source 
localized documents on the site. This allows the Web Server _ • • * * ^ , . A , : c 

503 to control the locale-independent templates 504, 10 L e £ m ? g ^ £ * e ^ ° LTu J 

language-specific elements 505, country-specific elements 70 ?' sta g^ /own the Workflow Pipeline 704 then dynami- 
506, and static HTML content 507. ^ routed and sequenced to the individual Translation 

Visitors to a multilingual Web site may wish to enteryv\ Resources where the actual translation is performed. Docu- 
information and feed back i mo_a fill-out form i n their locaT | men ^ ma y be WeD based ^ d non-Web based and may also 

contain non-text elements. The documents may contain or 
refer to additional dependent components such as graphics, 
audio, video, and other multi-media elements. 

When the Web site manager initiates the translation 
process, the newly updated master language document and 
20 its constituent elements, together with associated control 



lang ggge~to send back to the W&b site mana ger. A mecha- 
nism is provided, and described above, which facilitates the 
acquisition and translation of this information. The Devel- 
oper module captures and tags all such information in a 
sufficient manner so it can be subsequently translated and 
analyzed by the customer. ■ 

This approach allows a site to be built with minimal effort information will be converted by the Console to an internal 

and cost, and allows the site to grow over time to meet the format, one which is more suitable for information transport 

exact needs of the customer. over the Workflow Pipeline 703. 

Referring to FIG. 6, the Workflow Manager module The Manager's Console automatically controls the 

enables the Web site manager to orchestrate the creation of 25 sequencing and selection of Translation Resources during 

foreign language versions of the site and to automatically workflo w processing according to subject matter of the 

keep them synchronized and up-tondate, cost-effectively and document to be processed, target language of the translation, 

with minimal effort. The Workflow Manager module pro- quality level ( whether draft-only or high quality is required) 

yides a Manager's Console 601, access to a Workflow and other variables . ^ Web site manager can also indi . 

Pipeline 602, and customizable Translation Resources 603. 30 yi duaUy specify the use of a specific set of Translation 

Translation queues 604 hold the incoming translated docu- Resources 

ments and outgoing documents to be translated. Documents, ^ Manager > s Console provides management status and 
data streams, and non-text content are stored in one or more reporting as the translation process ensues 707. It then 
language and country databases and/or file systems 605. automatically notifies the Website manager of the document 
Data streams and non-text content are considered documents 35 1im ^ Maa completion and coordinates the delivery of the 
in toe following text translated documents back to the Web site for installation 
A brief summary of the duties of the three primary and optional review 7(W. The Web site manager can directly 
components are: j^j! thc translated documents back onto the active area of 
the Manager's Console, which serves as the user interface me Web site al the click of a button, or can make them 
the Workflow Pipeline, or transport layer 40 available for internal review within the organization, 
a set of Translation Resources that perform the actual Status information is presented through a highly usable 
translation work interface that facilitates interaction and improves the pro- 
file Workflow Manager module is the core component of ductivity of the Web site manager. The Console interface is 
the invention. The module allows the Web site manager, designed to allow translation and update of the multilingual 
with no prior linguistic experience, to effortlessly manage 45 Web site to be performed cost-effectively and with minimal 
the traditionally complex process of translating and updating effort by a non-specialist. 

a multilingual Web site. New languages are easily added to the site as well. The 

The Workflow Manager module provides a natural migra- Web site manager simply selects the new language from a 

tion path beyond the visitor and Developer modules. The pulldown list 701, and the Manager's Console automatically 

Visitor and Developer modules provide a solution to efli- 50 initiates the translation of documents into the desired 

ciently serve and organize content on a multilingual Web language, under the control of the Web site manager 702. 

site. The Workflow Manager completes the product family The documents are converted to the internal format used by 

by enabling the language content on the site to be kept the invention 703 and staged down the Workflow Pipeline 

up-to-date, efficiently and cost-effectively. 704. 

The Web's very nature is dynamic. A monolingual Web 55 The Manager's Console also has a built-in access and 

site must be updated frequently to remain current. A multi- version control system which allows it to be easily inte- 

lingual Web site has an even more demanding problem of grated with a third-party authoring or document manage- 

requiring all languages to be updated and synchronized ment system on the Web site. 

simultaneously. Using traditional techniques, the cost and With respect to FIG. 8, a task view of the Manager's 

time required to perform this update and synchronization 60 Console is shown. The Update Status module 801 updates 

exceed the benefit and lifetime of the newly updated infor- the Web site manager of the document translation status. The 

mation to be served, effectively rendering the Web site Complete Document module 803 retrieves the translated 

useless. Tbe Workflow Manager provides a solution to this document and its constituent or dependent components from 

paradox by reducing the cost and time requirement for the Translation queues 804 and updates the associated 

language update and synchronization dramatically, thereby 65 document status information. The document delivery to the 

ensuring that information on the Web site is always current, Web site manager is then coordinated by the Coordinate 

regardless of language. Document Delivery module 802. 
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Documents that need to be translated are extracted from erator for the appropriate Translation Resource 904. Each 

the language and country databases and file systems 808 and Adaptor accepts the internal format document, sends the 

are sent to Translation Queues 804. The Project Analyzer document to the appropriate Generator 901 which converts 

module 806 receives project components, in the form of the format to one that is acceptable to the Translation 

documents and other electronic content from the Translation 5 Resource 904 attached to the Adaptor 902. The Generator 

Queues 804 and analyzes the project in sufficient detail to 90 * me newlv formatted document to the Adaptor 902 

determine project cost and resource requirements. which in turn sends it to the Translation Resource 904. When 

The Project Analyzer module 806 waits until all elements me document has been translated by the Translation 

of a project have been received. It then sends the Scheduler Resource 904, the Adaptor 902 accepts the document and its 

module 805 a description of the project and instructs the to associated ^information and invokes a Parser 903 which 
Scheduler module 805 to begin the project Typically, the 

Manager module 809 to generate a project cost estimate extracted from ^ Translation aod ^ down tfae 

prior to beginning me project Pipelme. m Parser converts the document mto me internal 

Once the Scheduler module 805 receives the project 15 format. Generators are, m a similar manner, invoked when- 

information from the Project Analyzer module 806, it ere- ever a translated document is posted back onto the Trans- 

ates Work Packets, Job Tickets and a Project Schedule. A lation queues after completion of translation. The Generator 

single project consists of a set of Work Packets and associ- converts the internal format document back to the docu- 

ated Job Tickets. Each Job Ticket contains a reference to a ment's original format (e.g., HTML). The document is then 

Translation Resource. A set of Job Tickets is associated with 20 sent to the final destination. 

a single Work Packet and describes the activities to be With respect to FIG. 10, the Adaptor converts work 
performed by that Work Packet. The Scheduler module 805 packets to the appropriate Translation Resource format 
generates a Schedule that is sent to the Manager's Console 1001. The packets are then routed to the appropriate Trans- 
fer review and validation. The documents and their con- lation Resource 1002. 

stituent or dependent components are then scheduled and 25 ^ Adaptor also receives packets from Translation 

sent down to the Workflow Pipeline. resources 1003. The packet is converted back to the work 

The Scheduler module 805 updates the Project Schedule P ackct format 1004 and the status and control information in 

810 and notifies the Accounting Manager module 809 during me work P acket fe u P dated 1005 ^ Manager's Console is 

the course of the project. Tnis permits the Project Schedule men notified of ^ work P acket ' s availability 1006 in the 

810 to be adjusted when there are changes in the availability 30 translation queue. 

of Translation Resources, or when manual changes are made ^ of flexibility allows new translation technolo- 
to the project by an operator via the Manager's Console. ^ es and translation service providers to be quickly inte- 
The Accounting Manager module 809 updates customer S rated mto me workflow process. Additional resources are 
and Translation Resource accounting in the Accounting casil y dlocated to accommodate sites which require f re- 
Database 811 as jobs complete. The Accounting Manager 35 quent update and synchronization. 

module 809 generates project cost estimates (based on input Referring to FIG. 11, a task view of the Adaptor is shown, 

from the Project Analyzer module 806), handles customer ^ Convert to Translation Resource module 1101 converts 

billing, and is responsible for generating purchase orders for work packets to the appropriate Translation Resource format 

Translation Resources, when needed. that K understood by the resource. The packets are then 

Completed documents and their constituent or dependent 40 to the Translation Resource by the Route Packet 

components that have been checked by the Web site man- module ll02 * 

ager are then installed in one or more language and country Packets are received by the Receive Packet module 1103. 

databases and/or file systems 808 by the Monitor Database ^ <^vert/Update Packet module 1104 performs the 

Documents module 807. packet conversion back to the work packet format and 

The Workflow Pipeline is a transport layer that delivers 45 thc packet's status and control information. It then 

documents and their constituent or dependent components notifies mc Manager's Console that the work packet is 

requiring translation to the Translation Resources where the available in the translation queue. 

actual work is performed. The Workflow Pipeline is heavily The Workflow Pipeline together with the Translation 

instrumented so that status and tracking of ongoing work is Resources allow the translation process to proceed rapidly 

instantly conveyed to the Web site manager on the Manag- 50 ^ efficiently. 

er's Console. Translation Resources are centralized or distributed func- 

The Workflow Pipeline uses a highly configurable archi- tional blocks which are connected to the Workflow Pipeline 

tecture allowing a variety of Translation Resources to be where me actual work > including translation, is performed. 

instantly available to the Web site manager. These Transla- A l yP lcal workflow process will use one or more Translation 

lion Resources include both automated translation tools as 55 Resourccs in a specified sequence to complete the required 

well as human translators. work. 

Translation Resources are connected to the Pipeline using A Translat ">n Resource performs translation or other 

an open Application Programming Interface (API) known as ^guistic functions on a set of mput documents and their 

an Adaptor. Adaptors allow a variety of Translation constituent or dependent components and produces a set of 

Resources to be connected to the Pipeline, making it easy to 60 documents that have been transformed linguistically. 

tailor the Workflow Manager to meet a customer's require- . ^ Workflow Pipeline connects to any type of Transla- 

ments. The flexible Pipeline architecture can be deployed on tion Resource . 

intranets as well as the Internet. Four examples of Translation Resources are: 

Referring to FIG. 9, Generators 901 are provided that Machine translation systems 

convert the internal format document that is used within the 65 Translation memory systems 

Pipeline to the appropriate format required by the Transla- Human translators 

tion Resource. The Adaptors 902 invoke the specific gen- Terminology management systems 
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Machine translation is software which provides fully When a work packet is dispatched to a Translation 

automatic, draft-quality translations. Resource 1207 for processing, the packet first flows from the 

Translation memory systems provide a database of translation queues 1205, through the Pipeline Adaptor for 

source/target franslation pairs for a particular document. mat Resource it ^ averted from the Workflow's 

When an updated version of a document needs to be < ■ fo t e , - . _ # , . , 4 

translated, the translation memory can be used to pre- £ temal format mt0 one <** be prOCeSSed b * mat 

translate the document with translations from the previous esource. 

version of the document. ° nce * e Translation Resource 1207 has fimshed its 

Human translators are linked to the Workflow Pipeline processing, the packet which contains the work completed 

through a special Adaptor that connects to the Internet or a in bv the Tra °slation Resource 1207 flows back through the 

customer's intranet. This allows any translator, anywhere in Adaptor where it is re-converted back into the Pipeline's 

the world, to join a virtual translation team and provide internal format The Adaptor is also responsible for analyz- 

translation services to multilingual Web sites or any other m S the completion status of the Translation Resource and 

translation application serviced by the invention. updating the packet's control and status information accord- 

Terminology management systems provide functions that m gly* 

are necessary to support the entire translation process, such A sim P llfied sample of a typical workflow sequence is 

as automated generation and retrieval of client-specific shown below. 

glossaries and lexicons. Both human-based as well as 1* Machine Translation: automatically translate the 

technology-based Translation Resources require access to untranslated (i.e. new) sections of the document, 

appropriate glossaries and lexicons to ensure that linguistic 2Q 2. Translation Memory: pre-translate the document using 

transformations are performed in the correct terminological saved source/target translation pairs from the previous 

domain. version of the document. 

Technology-based Translation Resources are easily 3. Human Translation: post-edit the sections of the docu- 

upgraded as their technology improves. For example, older merjt mat were machine translated and review the entire 

machine translation or translation memory systems are 25 document for quality 

replaced with newer systems simply by unplugging the old 4 Terminology Management: obtain the correct lexicons 

systems and plugging in the new ones through the Adaptors * , . * j * 1 1 

onmeWorkflowHpeline. |^ gl<«sanes requjred to properly apply the other 

™. , . , a, t „. « . A translation Resources, 

The Manager s Console provides the Web site manager 0 - 4U „. . , . 4 A , 

•♦u * 11 * a * c *i_ , 4 ? Since the Pipeline 1206 is heavily instrumented, the 

with folly automated management of the entiretra^non 30 console 1202 gives the Web Site manager 1201 continuous 

process. The followmg is a desenpuon of how the Web s.te ^ m tbe ^ oftbe ^ Console 1202 

man^ermteracUwjt^eManager'sCo.isolemmeprocess £ ^ staras information to automatically provide 

°ur? 1D8aS l t 11 L ^ ' , . . 4 , „ dynamic dispatch and control decisions to achieve the fastest 

With respect to FIG. 12, when the site is current and I aU ^ most Ztettexivt ^onn* time, 

language content is update, the Manager s Console 1202 35 ^^^otte workflow sequence, the translated 

r eP uSd manager °° 15 docunie » te ™ back to the Corlole 1202 where the 

i f o . - . , . Web site manager 1201 is notified of their completion. From 

If a document m the master language is subsequently f , _ gf *j . , 

»~a«**a u * c a * a. * • r *■ ,l there, the Web site manager 1201 can directly install the 

updated-^jerhaps out-of-date product information on the . , . , 6 f . ' 4 . . 

w^u o • * a »u /-» i i<w» ii • • j. . translated documents back onto the active area of the Web 

Web site is being updated — the Console 1202 will immedi- fi1 4 , 4 ^ , 4 . ^ . 

w v i<wm .u ♦ .u j- 40 Slte 1203 > fi! e system (local or remote), or database (local or 

ately alert the Web site manager 1201 that the corresponding M . u ' . ,. , > u „ " , v , t 

* . ? *u j ♦ : c a • remote) at the click of a button, or can make them available 

^skss? *• document are ou "" ate fo h r *r* T e ^ r UB *• For r^ 16 ' 

i? • * *u u u i v i_ when the translated documents are installed onto the active 

From this point, the Web site manager simply clicks on a , ,i w , „ v , , , . 

u««^ * *u V» i » i-^A'* « *».»* w area of tne Weo Slte 1203, they are referenced by the 

button, using the Console's 1202 "one-touch" translation AC nwwm - f „ tj™t ' ^ iha i^h 

f . . . ii . i * 4S appropriate language HTML commands 1210, 1211, 1212, 

feature, to automatically miUate the re^ranslaUon and 12 \ s ^ the Web sile ' ^ d ^ 

^^u^° amC mw?lT n 18 ^ k franslated «> ntent fe «W"oved from the translated documents 

from the Web site manager 1201 until the documents have and ^ ^ m ^ databasc 

b^n translated and are ready to be mounted back onto the New knguages m ^ added to ^ Web ^ ^ w<jb 

m „. . .u . i.- 50 site manager simply needs to select the new language from 

When the Web site manager 1201 initiates the translation . „,„ . „ ... ... , . . . 6 * 

process, the newly updated master language document Tt **** f^T^ ^ 

together with associated control information will be con- ^3' *. n vn£2 ? ^.•"T TT^ ^ * aUt °" 

verted by the Console 1202 to an internal format, one which '^^^JS^^.^JT^ 

• . ( - ^ „. . a The invention provides the Web site manager with an 

is more suitable for information transport over the Workflow ee ^«x^w c ^ f • + j* 

d: !• iOAjC rpi j , * * i 55 effortless way of keeping the site up-to-date. 

Pipeline 1206. The converted document together with rel- . mi < • + , .1* . *n • ♦ *t- * m. u 

, *ij**. r *• • i * One skilled m me art wiU reaoily appreciate that, although 

evant control and status information is known as a work t * t_ j u j • *u * ^ c u 

ti,. n: i- • ui 4: mc invention has been described m the context of Web sites, 

packet. The Workflow Pipeline 1206 is responsible for t , 4 r , v „ . ' 

A . , , 4 . * ♦ * r th c same concept apphes to other applications. For example, 

sending and receiving work packets to an appropriate set of - a r n ^5 r ^ , ^ , , , ^ / 

T,«™if*™ t,_, i^fto nn i u *u in a rax apphcation, documents are scanned and sent through 

Translation Resources 1207, 1208, 1209 where they are cn • f _ * , , . + , 4 , . 

orocessed mvention for translation mto selected languages. The 

™~ • i • » rp , n translated documents are then automatically faxed to 

The sequence in which the Translation Resources are , - , , ..^ A . . , , J . . 

j l 4 . „. j . , seJected locations. Another example would be a universal 

engaged by the Pipeline is pre^eterrm^d according to: where ^ ^ ^ , ^ 

Settings previously established by the Web site manager stream ^ ^,,1 througn ^ mvention for transUtiwi into 
Subject matter of the content 65 designated languages. The translated data stream is con- 
Target language of the translation verted back into audible speech and sent to the designated 
Availability of a particular Resource audiences. 
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The invention is scalable to provide multilingual pre sen- To access the Web server via WebPlexer, you would type 

tation and management functions across multiple servers in a URL of the form: 

a distributed environment For example, some sites might http://www.mysite.com:iiii/ 

have English content on one server, Japanese on another, and where iiii is the IntemetPort assigned above, 

so on. Each server will have installed on its respective site, 5 Normal requests of the form: 

an instance of the invention communicating with other httpV/wwwmysite com/ 

instances of the -invention throughout the system. One server ^ go edn ^ y {Q me Web bypassing WebPlexer. 

is designated the master, and the others are slaves for the Y ou can make WebPlexer available to ordinary port 80 

purpose of managing content. The servers communicate users , even m test mode , b y including a special link on a 

over a dedicated interface allowing content to be managed in 10 page to ^ a request t0 WebPlexer's port, 

a distributed fashion. This configuration also supports mir- p or example 

rored sites | across multiple servers. <A HREF.«http://ww.mysite.com:3080/webplexer/ 

One skilled m the art wrtl readily appreciate that, although index.html">WebPle X er test</a> 

a smgle user system is specifically mentioned, the same . . , , , , . 

* ,. u . . 7 , 1 , could be included on a site's home page, 

concept applies to multi-user systems, for example, a multi- is * * * i» j. *• 

c i j i • * r t * * 3.1.1.2 Production Mode 

user configuration would be appropriate for an Internet T _ A „, , n1 . « j . 

o • t\ a ncm *u * * . • | . ... In production mode, WebPlexer is configured to nrocess 

Service Provider (ISP) that serves multiple customers, with n u „ „ ♦ u <- L *u , „, , 4 „. 

* *it_ a. tA- i v r . i . . -jr all ntt P requests before they reach the Web server. This 

and without multilingual sites, from a single instance of the „ ■ ,. r U j „*cv**j i*j 

& ' b would normally be done once off-line tesfang was completed 

invention. j * L •» j . * * j 

™ c 1t 4 - 1 j l- and the site was ready to go into production. 

Tie following is an excerpt of a user manual describing 20 To ^ prod ^ n B mode: 

the Viator and Developer modules for a preferred emoodi- _ „ . „ 

ment of the invention called "WebPlexer": ^ t0 80 , M L htt P k*! 1 "** ^ 1x5 P™" 

cessed by WebPlexer. Note that this will typically mean 

3. DETAILED FUNCTIONAL DESCRIPTION ^at WebPlexer must run as a privileged process. 

25 2. Set ServerPort to an unused port. This port will be 

The following sections are ordered beginning with basic dedicated to WebPlexer <— > Web server communica- 

functions and gradually progressing to those that are more t j on 

advanced Similar functions have been grouped together 3 Recoilfigure Web „m so it is connected to 

logically for easy reading. ServerPort. This is typically done by editing your 

3.1 Communication Ports 30 f^f *tjpdxonf ^ b * the configuration method is 

determmed by your specific server, so please check the 

WebPlexer has 3 network ports. These ports are used to documentation that comes with your Web server, 

communicate with the client, the Web server, and the con- Normal requests of the form: 

sole. The ports can be assigned to any available port number http://www.mysite.com/ 

on your system and are specified in WebPlexer's 35 will then go to WebPlexer for initial processing. 

WP_Config file. You can still access the Web server directly, with a URL 

The 3 ports are: of the form: 

Internetport. This is the port WebPlexer uses to commu- http://www.mysite.com:ssss/ 

nicate with the client. where ssss is the ServerPort assigned above. 

ServerPort. This is the port WebPlexer uses to commu- 40 312 Setting IntemetPort 

nicate with the Web server. IntemetPort is set by editing the line: 

ConsolePort. This is the port WebPlexer uses to commu- . I nteraetPort 

nicate with a client that is being used for the console. m me WP — Con ^S file* 

Please see Section 3.10, "WebPlexer Console," for A , ^ can ako be overridden by a command line 

more information on console operation. ^ Please «« Section 311 > "Runtime Options," for more 

Port numbers are assigned starting at 1. Lower numbers information. 

are reserved for dedicated processes like ftp(21), http(80), 313 Settia S ServerPort 

and telnet(23). The lower numbered ports are also privileged ServerPort is set by editing the line: 

in that they require root-level authority to use. For testing, 50 ServerPort ssss 

we recommend choosing an available port with a number of m me WP_Config file. 

at least 3000. This setting can also be overridden by a command line 

3.1.1 Operating Modes flag- Please see Section 3.11, "Runtime Options," for more 

WebPlexer's IntemetPort and ServerPort are configured information, 

to support two modes of operation, 'test mode* and 'pro- 55 3 -l-4 Setting ConsolePort 

duction mode*. ConsolePort is set by editing the line: 

3.1.1.1 Test Mode ConsolePort ccc 

In test mode, a separate access path is created for clients in the WP_Config file, 

to send requests to WebPlexer. Normal http requests to port This setting can also be overridden by a command line 

80 and the existing Web server are not affected. This permits 60 flag. Please see Section 3.11, "Runtime Options," for more 

off-line testing of WebPlexer functionality until the site has information, 
been validated. 

To configure test mode: 8 s 

1. Set IntemetPort to an unused port number. This will be WebPlexer supports 3 types of logs: 
used for all incoming WebPlexer requests. 55 AccessLog 

2. Set ServerPort to 80. This will allow WebPlexer to ErrorLog 
connect to your existing Web server. InformationLog 
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The filenames of these logs are defined in the WP_Config 
file. Additional information on how to define the log files can 
be found in Chapter 4, "Contents of WebPlexer Files." 

NOTE: A fourth log, the Usage Log is generated by the 
updtusage tool that analyzes the Information Log. This tool 
will be included in a future release. 

The function of each log is described below. A detailed 
specification can be found in Section 4.4, "Log Specifica- 
tions" 

3.2.1 AccessLog 

The AccessLog records information about each request 
received by WebPlexer, such as date, time, and request type. 
WebPlexer command requests (WPCommands) are also 
logged. 

NOTE: When WebPlexer is configured for "production 
mode" (see Section 3.1.1.2, "Production Mode"), WebPlex- 
er's Access Log should be used in place of the Web server's 
access log for analysis and auditing purposes (the Web 
server's access log will show all requests originating from 
WebPlexer). 

Examples of AccessLog entries are: pp93.satcom.net — 
[28/Jul/1996:23:37:23-0700] "GET/cnv.post/info.html 
HTTP/1.0" 302-pp93.satcom.net— [28/Tul/1996:23:37:24- 
0700] u GET/cnv.post/english-american__United-States/ 
info.html HTTP/1.0"200 3372 pp93.satcom.net— [28/Jul/ 
1996:23:37:29-0700] "GET/cnv.post/coming/flags/United- 
States.gif HTTP/1.0" 200 3594 

3.2.2 InformationLog 

The InformationLog contains detailed information about 
WebPlexer's language selection processing for each request. 
Examples of InformationLog entries are: 
prune.loria.fr— [26/Jul/l 996:05:44:43-0700] Country 

Selection: prune.loria.fr-»France 
prune.loria.fr— [26/Jul/l 996:05:44:43-0700] Language 

Set: prune.loria.fr-»'{french-continental, french} 
prune.loria.fr-H[26/Jul/1996:05:44:43-0700] Language 

Selection : prune.loria.fir-*French-continental 

3.2.3 ErrorLog 

The ErrorLog shows any errors that occurred during 
WebPlexer operation, or during the processing of a request. 
Examples of ErrorLog entries are: 
[23/Jul/1996:03:35:40-0700] FATAL: Could not send 

request to HTTP server!! ! 
[23/Jul/l 996:04:03:34-0700] Timed out when writing 

response to client 

3.2.4 UsageLog 

The UsageLog contains a summary of country and lan- 
guage statistics. It is generated from the InformationLog 
using the usage log analysis tool The UsageLog can be 
further processed using the visits log analysis tool. Please 
see section x.xx for a description of WebPlexer's log analy- 
sis tools. 

Examples of UsageLog entries are: 
[17/Oct/1996:13:40:ll-0700] Language: German 

www07.btx.dtag.de 
[17/Oct/1996:14:09:57-0700] Country: Argentina 

serverl.srmnter.com.ar 

3.3 Language Selection 

WebPlexer's Language Selection feature allows the most 
appropriate language of a requested document to be served 
to a Web browser. 

"Most appropriate language" means a language that has 
been: 

specified as a preference by the Web browser 
automatically selected by WebPlexer from the country of 
origin 



manually chosen by the user 
specified by a cookie 

These three language selection modes all work together 
seamlessly. They do not require the addition of any special 
5 CGI scripts or links to be added to your Web site. 

The language selection algorithm requires that each lan- 
guage be assigned a unique directory in the document tree. 
All documents in that language are placed in the correspond- 
ing directory. The entire document tree then becomes a 
10 collection of parallel language directories. 

For example, if user 'Joe' has english, trench, and Spanish 
documents on your site, the directory structure might look 
like this: 
htdocs/ 
15 joe/ 

english/ 

index.html 
info.html 
french/ 
20 index.html 
info.html 
Spanish/ 
index.html 
info.html 

25 WebPlexer's language selection algorithm works by 
inserting the correct language directory in the URL before 
sending the request on to the Web server. 
For example, an incoming URL of: 
30 http://www.lai.com/5oe 

would send a modified URL of: 
http://www.lai.com/joe/french 
to the Web server, if the browser's preferred language were 
French. 

3.3.1 The Document Matrix 

A group of language directories and the documents they 
contain can be likened to a matrix. This matrix has one row 
for each language, and one column for each document in the 
40 collection. When expressed in this form, the collection is 
known as a document matrix. 

In the example just given, the document matrix might 
look something like: 



35 







index, html 


info .html 




english 


X 


X[001b] 




french 


X 


X 


50 


Spanish 


X 


X 



where the *X' indicates the presence of a document in that 
language. 

55 WebPlexer's exclusive document management feature 
known as "SPAMM" (Sparse Matrix Manager) does not 
require each document to be translated in every language, 
and therefore does not require every cell of the document 
matrix to be filled in. 

50 In addition, SPAMM also permits templates to be placed 
only in the default language directory (see Section 3.3.23, 
"Specifying the Local Default Languages") and eliminates 
the need for them to be instantiated in every directory. 
SPAMM offers an incremental approach to building a 

65 multilingual Web site. Please see Section 3.9.2, "SPAMM 
(Sparse Matrix Manager)," for more information on this 
feature. 



06/24/2003, EAST Version: 1.04.0000 



US 6,526,426 Bl 

17 18 

3.3.2 Setting Up Language Selection <charset> 

WebPlexer's Language Selection feature is enabled by is the encoding used for documents in that language (e.g. 
editing a few lines in the WP_Map file, located in the main 1SO-8859-1) 

WebPlexer directory. Once the file has been set up, any of In the previous example, these lines would read: 
the language selection modes (browser-specified, automatic, 5 english: english ISO-8859-1 

manual) are available. french: french ISO-8859-1 

The information that needs to be entered in the WP_Map Spanish: Spanish ISO-8859-1 

file is: 3.3.2.3 Specifying the Local Default Languages 

1. The top-level URL of your site. The local default languages tell WebPlexer which lan- 

2. The list of languages on your site and the names of the 10 to use in case the requested language is not available 

directories where the documents are located, on J~ e sen/er * , , , r , , 

n ™ . A , . , Jnere are two local default languages that need to be 

3. ^eenraduissmwhKh the documents are represented specified. The first is called "others" and the second is 
(e.g. lbU-8859-1) simply calkd « defau i t » 

4. Default directories in case the requested language is not The "others" language is chosen when the requested 
found (usually English). language is valid but does not exist on the server. 

These four pieces of information together make up what For example, if English, French, and Spanish documents 

is known as a "URL group" declaration. exist on the server, but the requested language is Russian, 

Often a single server will serve multiple sites. You can WebPlexer will use the language defined as "others", 

easily have WebPlexer support multiple sites by creating a Q The "default" language is chosen when WebPlexer cannot 

separate URL group for each site. determine the language of a request and none has been 

The next sections explain each component of the URL specified by the browser. This can happen when the country 

group declaration. Please refer to the example WP__Map file of origin cannot be determined because the hostname is 

supplied with this release while reading the following sec- unavailable, for example, when the requester is behind a 

tions. firewall. 

3.3.2.1 Specifying the Top-Level URL 25 These directories are specified by including the following 
The top-level URL is specified by adding lines of the form lines in the WP__Map file: 
URL::<path><path_type> others: <others_Janguagexcharset> 

to the WP_31ap file. default: <default_languagexcharset> 

where 30 It is usually the case that "others" and "default" are both 

<path> set to English, 

is the directory path of the top-level URL below the htdoc 1° me example above, these lines would read: 

root. others: english ISO-8859-1 

<path-type> default: english ISO-8859-1 

is either 'd* or T depending on whether the path is a 35 The next sections describe the language selection algo- 

directory or a file. rithms in more detail 

NOTE: In the current implementation, when a directory is ^33 Browser-Specified Language Selection 

specified, it is also necessary to specify the corresponding In browser-specified language selection, the browser 

top-level file as well (usually index.html). sends its language preference along with the http request. 

In the previous example, there would be two lines read- 40 The language is set through a preference or option setting in 

ing: the browser. The preference is sent to WebPlexer in the 

URL::/joe d "Accept-Language" header within the request message. 

URL::/joe/index html f ^ Accept-Language header is described in the http 

Note that this file tfoe/index.html) does not actually exist in Variation. ^ specification allows the browser to send 

the document hierarchy but should match the URL the user 45 more ^ °f the hf ^er ^e group of lan- 

will type when entering the site. 15 ordered ^ the most language first. 

If the selected language were French, the URL mappings uses Acoept-Language header to select the 

would be* correct language. The header is only used when the request 

, H „ , . . . is to a URL that matches the top-level URL given in the 

http7/www.lai.com/joe-http://www.laixom/joe/french ^ wp_ Map file (sce 421> ^ pecifymg ^Top-Level 

and URL"). 

http://wwwaai.conyjoe/index.html^http://www.lai.com/ For example, if the top-level URL were /joe, and the 

joe/french/index.html browser preference were set to Spanish, a request to: 

3.3.2.2 Specifying the Language-to-Directory Map and http://www.lai.com/joe/ 

^\ j. , , 55 would cause the URL to be mapped to: 

^^f*^™? ^arset (document http://www.lai corn/joe/spanish 

encoding) are specified by adding lines of the form: and ^ tQ the Wcb J^J 

<language>: language directory><charset> If mc browser has dfied more ^ onc languagc> 

immediately after the URL specificanon. WebPlexer goes through the languages in order and sfop 

w erc 60 when it finds the first one that is present on the server. If 

<language> none Q f me languages m found, WebPlexer uses the "oth- 

is the name of the language. Please see Appendix B., "List ers" language specified in the WP_Map file (see Section 

of Available Languages," for a list of which languages are 4.2.2.1, "others:") 

available for this release. Once the language has been determined, WebPlexer redi- 

<language_directory> 65 rects the browser to the directory for that language. All 

is the name of the subdirectory containing documents of that accesses to relative links within the requested document stay 

language. within that directory. 
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Changing the browser's language preference does not http://www.lai.com/joe/english_United-States/ 

have any effect until the top-level URL is again requested WPCommand?language=french 

and the browser re-enters the site. The recommended way to which will map to 

change language within a language directory is by manual http://wwJai.coniyjoe/&ench_United-States/xxx.html 

selection (see Section 3.3.5, "Manual Language Selection"). 5 where xxx.html is the current document (the one that cbn- 

Browser-specified language selection has the highest pri- tains the WPCommand link). Note that WebPlexer will 

ority upon entry to the site. return the browser to the original document in the new 

3.3.4 Automatic Language Selection language, in this case, French. 

In automatic language selection, WebPlexer determines The manual language selection feature is performed com- 

the language by first determining the country from the pletely by WebPlexer, without the need for CGI scripts or 

requester's domain, then looking up the language for that complex interlinking of HTML documents. It merely 

country in a database. requires the addition of a link in the HTML to send the 

Automatic language selection is only active under the WPCommand to WebPlexer for each desired language, 

following two conditions. . The WPCommand hypertext link can be used like any 

to ZKwJ? tV ^ iT ^l 1 "^ 1S other HTML link, that is\it can be placed within a table, 

WP_Map file (see Section 3.3.2.1, "Specifying the Top- 15 . , . « K u . . . ' 

Level URL") connected to a graphic or icon, or built into an imagemap. 

2. The browser has not set a preferred language (see Section ^3.6.2 Using WI^^and?link 

3.3.3, "Browser-Specified Language Selection")- nc mcthod dcscribcd abovc works weU whcn mc numbcr 

NOTE: If the browser has specified a language preference, of languages is small. However, as the number of languages 

it overrides any automatic language selection. 20 on me t*™ 1 increases, there may not be enough space to 

For example, if the top-level URL were /joe, and the host have me links for the languages appear on each page, 

domain were eiffel.tower.fr, a request to: In this case* a generic mechanism exists for handling 

http://www.lai.com/joe/ WPCommands that will move all of the language selection 

would cause the URL to be mapped to: links off to a separate page. Once the links are moved, a 

http://ww.lai.OTm/joe/french 25 hypertext link that points to this special language selection 

and sent to the Web server. page is placed on each page of your site. 

The country-to-language database contains a list of lan- Because of the need to return to the originating page when 

guages for each country. These languages are arranged in the language is changed, WebPlexer requires that a special 

order of frequency from most-used to least-used. WPCommand be used to invoke the link to the change 

This list of languages is processed in the same way as 30 language page, 

preferred languages are handled when received from a This WPCommand is of type 'link' and takes the name of 

browser (see Section 3.3.3, "Browser-Specified Language the page to link to as an argument. Its external behavior is 

Selection"). This means that the languages are examined the same as if a direct HTML link were used, except it 

until the first one present on the server is found. notifies WebPlexer that a change language (or change 

For example, the languages of Switzerland are {german, country) WPCommand is about to be issued, 

french italian, romans^ This allows WebPlexer to return the browser to the 

The rules for selecting the local default languages are the origmating pagc oncc me language has changed, 

same as for browser-specified languages, if the language IntoeJHTML, this link would be codedls 
cannot be round. 

3.3.5 Manual Language Selection <A HREF-^CoiruBand?link=langsel.html»>Change 
Manual language selection is effective at any time. 40 Language </A> 

Manual language selection overrides any automatic or and would produce a request URL to WebPlexer of the form 

browser-specified language. http://www.lai.com/joe/english United-States/ 

Manual language selection is performed by issuing a WPCommand?link=langseLhtml 

WPCommand whose type is "language". WPCommands are where langseLhtml is the language selection page, 

more fully described in Section 3.6.3.3, "WPCommands." 45 3.3.7 Hostname and EP Address Overrides 

They result in the browser sending a request to WebPlexer WebPlexer provides a method of optionally forcing the 

of the form: selection of specific language(s) for certain hostnames or ip 

http://www.lai.com/joe/xxxx/WPCommand71anguage- addresses. 

yyyy This feature overrides automatic language selection when 

where xxxx is the current language (phis country) directory 50 the top-level URL specified in the WP_Map file is 

and yyyy is the desired language to switch to. requested. A browser-specified language preference will still 

NOTE: language (plus country) means that the language be used if specified. 

directory is appended with the currently selected country There are two files used to specify automatic language 

(e.g. english_United-States) selection overrides: HostNameMap and IPAddrMap, both of 

Note that even with manual language selection, the lan- 55 which are in WebPlexer's language subdirectory, 

guages and directories in the WP__Map file still need to be The format of these files is described in Section 4.3, "Map 

set up as in automatic language selection. Overrides.*' 

3.3.6 Cookie-based Language Selection 3.3.8 Setting the Global Default Language 

3.3.6.1 Using WPCommand?language The global default language is set in the WP _31ap file 

WPCommands are normally issued through hypertext 60 with the directive: 

links embedded in an HTML document. This statement defaultLanguage <language> 

illustrates a simple example, and is normally the same as the local default language 

<A HREF="WPCommand?language=french'*>French</ specified in the URL group definition. Specifying the local 

A> default language is described in Section 

If the user is currently looking at an English document, 65 3.3.2.3, "Specifying the Local Default Languages." 

clicking on the word 'French' will cause the browser to send The global default language is used as the default for the 

WebPlexer a request for the URL console and for some WebPlexer initialization. 
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3.3.9 Content Tagging and Support for Multilingual Brows- and which will fetch the German version of the info.html 

ers document. 

Content tagging allows WebPlexer to inform the browser 3.4.3 Cookie-based Country Selection 

of the language and charset used to encode a document when 3.4.4 Declaring Worldwide Regions 

the document is accessed. Some browsers, like Netscape 5 Regions are denned in the WP_Regions file. Regions 

Navigator, use this information to automatically switch to allow groups of countries to be combined into geographical 

the correct font and encoding for a particular language. areas that are meaningful for the type of content that is on 

Content tagging is always active whenever a document is the site, 

fetched from a language directory that is specified in the WP For example, if a company had one sales office in Europe, 

Map file. 10 and another in Asia, and had developed Web site content that 

When a document is content tagged, the hup Content- was specific to these markets, it might want to create two 

Language and Content-Type response headers are returned regions like this: 

with the document to indicate the language and charset in Sales_regionl: France, Germany, Italy, Spain 



use. 



Sales_region2: China, Hong-Kong, Japan 



These fields are automaticaUy updated by WebPlexer 15 Regions can be eithcr ^^^^ or pre-defined 

even when the language is changed by manual selection. (default). User-defined regions are specified in the 

_ . WP_Regions file (see Section 4.9, "WP_Regions"). 

3.4 country Selection Pre-defined (default) regions are established by Web- 

WebPlexer provides three ways by which the country of Plexer. Please see Appendix D., "List of WebPlexer Default 

an incoming request is determined. 20 Regions," for more information. 

Automatic country selection Regions are used by the multi-country server-side include 

. , , * . feature (see "Multi-Country Server-Side Includes (MCSSI) 

Manual country selection „ } ^ allow content to be presented to ^ from 

Cookie-based country selection countries in those regions when they enter the Web site. 

These are described in detail below. 25 345 Setting the Global Default Country 

3.4.1 Automatic Country Selection dc£ault fa ^ m the wP_Map file with 
With automatic country selection, WebPlexer determines the directive: 

the country from the requester's domain. If the domain defaultCouutry <country> 

cannot be determined (usually because the requester is whefC <country>is one of mc valid ^ m 

behind a firewall), me country is set to the global default 30 ^ ^ c ^ of Available Countries." 

country (see Section 3.4.5, "Setting the Global Default ^ dcfault ^ ^ ^ me dcfault for the 

Countrjr). console and for some additional WebPlexer functions. 

Automatic country selection is only active when the 
browser request is to the top-level URL specified in the 3-5 Country and Language Management 
WP_Map file. See Section 3.3.2.1, "Specifying the Top- 35 WcbP i excr has the unique ability to independently main- 
Level URL," for a detailed description of how to set the ta m both the country and the language associated with each 
top-level URL. Once the country has been determined, its request This provides a great deal of flexibility in selec- 
value is saved by appending it to the language directory in tively targeting content to specific regions or countries of the 
the mapped URL. world and, at the same time, making that content available 

For example, if the top-level URL were /joe, and the host 40 m mu |tiple languages, 

domain was eiffeLtower.fr, a request to: It ^ important to understand how WebPlexer's country 

http://www.lai.com/joe/ management and language management are different and 

would cause the URL to be mapped to: how they are similar. 

http://www.lai.com/joe/french_France/ ^ In the following description, the term "session" refers to 

By embedding the country name in the URL, WebPlexer a logical grouping of requests between browser and Web- 
can retain the country setting for the lifetime of this session. Plexer. A session begins with the browser making a request 

3.4.2 Manual Country Selection for the top-level URL. It ends with the final response being 
Manual country selection is performed by issuing a sent from WebPlexer to the browser. A session can have any 

WPCommand of type "country*'. WPCommands are more number of request/response pairs between these two end- 
fully described in Section 3.633, "WPCommands." They points. 

result in the browser sending a request to WebPlexer of the A session has a state associated with it. Included in this 

form: state are the country and language of the requester. 

http://www.lai.com/joe/xxxx/WPCommand7country~ Once the country and language state have been deter- 

yyyy 55 mined from the initial request, they remain unchanged until 

where xxxx is the current language directory appended to the browser initiates an action like manually changing the 

the current country, and yyyy is the desired country to switch country or language. 

to. Once the country changes, the language will also change Unfortunately, the http protocol used for data transport on 

to match the new country. the Web is a stateless protocol. This means that the protocol 

For example, if the current country is France and the ^ does not allow state to be maintained from one request to 

current language is French, and the current document being another. 

viewed is info.html, to request that the country be changed WebPlexer uses a unique persistence mechanism for 

to German, this request should be sent to WebPlexer: retaining session state across multiple requests. This mecha- 

http://www.lai.com/joe/french_France/ nism allows country and language information to be saved 

WPCommand?country=germany 65 throughout the lifetime of a session. 

WebPlexer will respond with a new URL that looks like WebPlexer uses a simple 4-step algorithm for managing 

http://www.lai.com/joe/german Germ any/info, html country and language when processing a request: 
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1. Determine country. feature. This feature is more fully described in "Multi- 

2. Determine language from country. Language Server-Side Includes (MLSSI)." Additional 

3. Select content for country. information can be found in Section 3.6.1, "Template 
a w j-r * i Model," which describes WebPlexer' s template model. 

4. Modify content for language. „ „, / . . 

Once the country and language for a request have been 5 3 " Words ^ *^nces that are marked up with Web- 
established, they are available as state variables within the n ™ l ' s WPReplace tags are dynamically "translated" 
HTML and can be used for: kto request language using an external TermDB. 

j fy | . . « 4 . t WPReplace tags and TermDBs are described in Section 

1. Content selection based on country, region, or lan- 3 6 3 * "WPReplace," and Section 3.6.2, "TermDBs 
guage. 10 (tM) „ 

2. Word and phrase replacement out of an external glos- 
sary (this glossary is referred to as a TermDB). 3 6 Multilingual ToolKit (tM) 

3. Setting language and country-specific "custom vari- 
ables" such as country flag, language name, and coun- The Multilingual ToolKit is a set of HTML extensions and 
try name. is templates that simplifies the design and construction of 

4. Sorting a list of items according to sort rules of the multi-language and multi-country Web pages. 

currently displayed language. The Multilingual ToolKit consists of several key Web- 

WebPlexer's 4-step algorithm for country and language Plexer resources: 

management is described in more detail below. i. a template-based model of page construction. 

3.5 1 Determine Country m 20 2. External, multilingual glossaries known as TermDBs. 

WebPlexer will first determine the country of the _ A , - T ™„ „ . , 

requester if it is not known (on an initial requester when 3 ' , A of ™- e * a *>w 

it is manually selected (through a WebPlexer change country lan f a f t0 ™**>l a document s 

command (see Section 3.4.2, "Manual Country Selection")). content. 

3.5.2 Determine Language from Country 25 4 - Built-in support for manually changing language and 
WebPlexer uses the country from the previous step to country, eliminating the need for special CGI scripts or 

determine the slanguage of the requester. A list of languages complex interlinked pages. 

for the selected country is obtained from a database and the 5. Custom variables to facilitate customization of a docu- 

first one that matches a language present on the server is ment according to the current language and country, 

selected. 30 g. Custom functions to change the appearance of a page 

For example, if the requestor's country were Switzerland, according to language and country, 

the requestor's language could be set to either german, 7. Server-side includes to allow a document to be built out 

french Italian, or romansch depending on which one was of language and country-specific component parts, 

found first on the server. ^ foll owing sections discuss the specific components of 

The exceptions to this rule are: *s ^ Multilingual ToolKit in more detail. 

1. When the browser specifies the language explicitly in 3.5.1 Template Model 

its preferences (see Section 33.3, "Browser-Specified WebPlexer's Multilingual ToolKit uses a template-based 

Language Selection"). approach to dynamically create documents tailored for a 

2. When the language is overridden through a HostName ^ specific language or country. 

or IPAddress map (see Section 3.3.7, "Hostname and IP a template contains placeholders for country and 

Address Overrides"). language-specific information that has been removed from a 

3. When the language is manually selected through a document. This information is dynamically inserted from a 
WebPlexer change language command (see Section TermDB, another template or document, or provided auto- 
3.3.5, "Manual Language Selection"). Once country 4S matically by WebPlexer when the composite document is 
and language are determined, the following rules apply. presented to the browser. 

3.5.3 Select Content for Country The basic approach to creating a template has two steps. 
WebPlexer uses the country to dynamically select content First, remove as much country and language-specific infor- 

specific to that country. This allows the content to change mation as possible from a document. Then, replace this 
based on the geographic location of the requester. The actual 50 information with appropriate WebPlexer tags and corn- 
selection mechanism is controlled by the multi-country mands. 

server-side include feature. This feature is more fully The advantage to using templates is that a single docu- 

described in "Multi-Country Server-Side Includes (MCSSI) ment can easily support many languages and countries. 

." Additional information can be found in Section 3.6.1, For example, a single form or CGI-generated document 

"Template Model," which describes WebPlexer's template 55 can be constructed so that it will be automatically localized 

model, for different languages and locales. 

5 3.5.4 Modify Content for Language 7^ significantly reduces the' number of documents that 

WebPlexer presents the content in the language selected have to be maintained on the site and makes it very easy to 

by the requestor. add new languages. It also allows a single update to a 

There are 3 methods that WebPlexer uses to present 60 document to be immediately propagated to all languages and 

content in the selected language: countries. 

1. A static document that has been pre-translated is This technique also allows country or regional content to 
selected from the corresponding language directory. be conveniently separated from the structure of the 
For example, if the language is French, the document document, making it easy to change a design often by 
will be retrieved from the french directory. 6S updating a single file. This eliminates the ripple effect often 

2. Content that has been pre-translated is inserted into a seen when a single change must propagate through all of the 
template using the multi-language server-side include localized documents on the site. 
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A 3.6.1.1 Template Types 
^ WebPlexer supports three basic kinds of templates. 

1. Static (no template). 

2. File-based. 

3. Tag-based. 5 
One document may contain all three template types. We 

call such a document a "composite" document, since it is 
made up of multiple elements. 

Most documents will have a mixture of these three basics 
types. In the sample included with this release, the file 
'info. html' is a combination of static and tag-based 
templates, whereas the template of 'contact.htmr is com- 
pletely tag-based. 

Please look through the sample files in the htdocs direc- 
tory of this release for examples showing the use of tem- 
plates. 

File-based templates require the server-side include fea- 
ture. This feature is not available in the current release. 

Tag-based templates are the most generic and allow a 
single document to be used across all languages and coun- 20 
tries. 

In general, the smaller the amount of static content in a 
template, the more generic the template is and the fewer the 
number of times it needs to be instantiated. 

The next sections discuss the template types in more 
detail. 

Static 

Static templates are not really templates at all. They are 
simply collections of static text. 
File-based 

File-based templates allow external files containing local- 
ized content such as country, regional, or language-specific 
information to be inserted dynamically into a document. 
File-based templates are provided using the server-side 
include feature described in Section 3.6.3.5, "Server-Side 
Includes." 

Tag-based 

Tag-based templates allow replacement of marked-up text 
with translations from an external TermDB according to the 
selected country or language. Tag-based templates are pro- 
vided using the WebPlexer HTML extensions described in 
Section 3.63.1, "WPReplace." 
3.6.2 TermDBs (tM) 

A TermDB is an HTML representation of a multilingual 
glossary. The glossary is built in the form of an HTML table. 
Rows in the table correspond to entries in the glossary, be 
they words, phrases, or arbitrary strings. Columns in the 
table correspond to individual languages in the glossary such 
as English, French, or German. 

One language in the TermDB is designated the primary 
language and is assigned to column 1 of the table. This 
primary language is used as the language of the key when 
looking up entries in the TermDB. 

The TermDB is used together with the WPReplaceBegin 
and WPReplaceEnd tags. (See Section 3.6.3.1, 
"WPReplace"). The tags enclose text that will be replaced 
by a TermDB entry. The text must be in the primary 
language. 

The following example shows how the word "hello" 
would be translated using the TermDB. 
Sample HTML: 

<!--WPReplaceBegin«>hello<!-WPReplaceEnd-»> 
Sample TermDB: 
english hello 
french bonjour 
Spanish hola 
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When WebPlexer accesses the TermDB, it first finds the 
word 'hello' by looking in row 1 (the primary language row) 
of the TermDB and finding the column that contains the 
desired term. Then it retrieves the term from the row of the 
table corresponding to the current language (row 2, in the 
case of French) in the column it identified as containing the 
desired term. 

This release of WebPlexer comes with a standard TermDB 
that includes the names of all languages and countries 
translated into 25 languages. 

WebPlexer supports the use of multiple TermDBs. Each 
top-level URL declared in the WP_Map file can be asso- 
ciated with a different TermDB, and multiple TermDBs can 
be defined for a given URL. This makes it possible to 
maintain several small TermDBs rather than one large one. 
There are three types of TermDBs: 
The System TermDB (included with the product) 
Add-on TermDBs (available from Language Automation) 
User-defined TermDBs (created with the tpbuilder utility, 

explained in the next section). 
TermDBs, together with WebPlexer's WPReplace tag, 
allow multilingual terminology to be centralized. This has 
several advantages: 

It makes it easy to add support for new languages without 

changing any of the documents on the site. 
Changes to the form or structure of a document can be 
made immediately available in all languages. All lan- 
guages can be updated simultaneously. 
The number of documents that need to be maintained on 
the site is reduced. It is not necessary to have separate 
versions of a document for each language. 
A single CGI script or form can be localized for multiple 
languages from a single copy of source code. No update 
is required when new languages are added. 
TermDBs and the other resources of the Multilingual 
ToolKit can be used most effectively if you plan appropri- 
ately before you design your site. If you need assistance in 
building your multilingual site, please contact Language 
Automation and one of our site support engineers will be 
happy to assist you. 

3.6.2.1 Building User-defined TermDBs 
User-defined TermDBs can be created and updated using 
the tpbuilder utility that comes with WebPlexer. 
tpbuilder has the following syntax: 
tpbuilder <controlFilc><outputTermDB>or 
tpbuilder 

<controlFUe><mputTermDB><outputTennDB> 
The first form is used when creating a new TermDB. The 
second form is used for updating an existing TermDB. 
controlFile has the following syntax: 



# comments 
<TermDB title string* 
<TermDB table caption* 
<TcrmDB text body> 
<b)ank lino 

<masterLanguage> <encoding> <input-file> 
<languagel> <encodingl> <input-filel> 
<Janguage2> <encoding2> <dnput-file2> 
<language3> <encoding3> <input-fiJe3> 
<language4> <encoding4> <input-fiJc4> 



65 



<languageN> <encodingN> <input-fileN> 

where 

<TermDB tide string> 
is the title string in the generated HTML file. 
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<TermDB table caption> 
is the caption of the generated table. 

<TermDB text body> 
is the string inserted before the actual table and is part of the 
text body. 

Note that these strings are for informational and readabil- 
ity purposes only and are not used by WebPlexer. 
A line of the form 

<lang^ageN><encodingN><input-fileN><conirnent> 
specifies the target language, encoding and input file that 
contains the list of strings to be entered into the TermDB. 
The first line must specify the TermDB's master language. 
The master language is used to index into the TermDB using 
a key. The key is obtained from the string contained between 
< ! -WPReplaceBegin~>and < !-WPReplaceEnd— >tags. 
NOTE: the control file and input files must be in the same 
directory. 

For example, a typical control file might look like this: 

Joe's TermDB 

This is the TermDB for Joe 

Joe's TermDB 



english 
chincac 
f reach 



ISOS859-1 

GB2312 

ISO-8859-1 



laieng.txt 
laichs.txt 
laifre.txt 



In this case, english is defined to be the master language 
of the TermDB and the list of index terms will be taken from 
the file laieng.txt The corresponding translations into arabic 
and trench will be read from the files laiarah.txt and 
laifre.txt, respectively. 

The input files have the following syntax: 

1 : <TermStringl > 

2: <TermString2> 



N: <TermStringN> 
where <TermStringN>is a string to be entered into the 
TermDB for that particular language. Note that the numbers 
at the beginning of the line are arbitrary and are used to 
easily align the text across each of the input files. 

For example, a typical set of input files might look like 
this: 
English 
l:hello 
2:goodbye 
French 
l:bonjour 
2:au revoir 
Spanish 
l:hola 
2:adios 

When tpbuilder is run, it will write status information to 
stdout. 

Updating an Existing TermDB 

An existing TermDB is updated using this form of the 
tpbuilder command: 

tpbuilder 
<controlFile><inputTennDB><outputTermDB> 

NOTE: The controlFile must contain all of the languages 
that were specified when the TermDB was initially created. 

Generally speaking, there are three types of update opera- 
tions that can be performed on a TermDB. 
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Adding new entries (and optionally their translations) 
Replacing old entries (and optionally their translations) 
Deleting entries 

These operations are controlled through the contents of 
the individual input files specified in the controlFile and are 
described in more detail below. 

Adding an Entry 

To add a new entry, simply add a new TermString to the 
master language input file and optionally, corresponding 
TermStrings in the target language input files. If a matching 
translation is not found in a target language input file, an 
empty entry will be created and will be filled in later when 
the translation is available through the WebPlexer console's 
translation management feature. Please see section x.xx for 
further information on console operation. 

If an empty TermDB entry is accessed during a lookup, 
the TermDB will return the value of the original master 
language string (i.e. the index). 

NOTE: Prior to updating a TermDB, it is recommended 
that TermStrings that will not change be removed from the 
input files (i.e. input files should only specify actions to be 
performed during the update). 

Replacing an Entry 

To replace an entry, simply replace old translation^) in 
the target language input files with new translations of the 
TermString. Hie original TermString must be specified in the 
master language input file. 

If new translations are not available, simply put a blank 
line in the target language input file(s) with a number that 
matches the original TermString in the master language 
input file. The translations will be updated later through the 
WebPlexer console's translation management feature. 

Deleting an Entry 

To delete an entry, simply put a minus sign ('-') in front 
of the line in the master language input file corresponding to 
the entry to be deleted. The master language entry for that 
TermString along with all translations will be removed from 
the TermDB. 

Including HTML Tags in a TermDB Entry 
HTML tags can be included in TermStrings to allow 
complete HTML constructs to be embedded between WPRe- 
place tags. This provides a great deal of freedom in content 
organization. 

Embedded HTML tags must be preceded by a backslash 

CV) 

For example, the following is a valid TermString: 
25:<img src= /balloon -gif J\> 

The backslash character can also be used as a valid 
character in the TermString but it must be preceded by 
another backslash. 
3.6.3 HTML Extensions 

The Multilingual ToolKit's HTML extensions are a set of 
special-purpose commands and comment-embedded HTML 
tags. They enable WebPlexer to provide special country and 
language processing when serving a document. 

The extensions consist of 

1. WPReplace for replacing a text string with its transla- 
tion from a TermDB. 

2. WPSort for sorting lists according to the sort rules of 
the currently selected language. 

3. WPCommands for providing the change language/ 
change country function without the need for CGI 
scripts or interlinked pages. 

4. Custom Vars for dynamically inserting language name, 
country name, and country flag into the HTML. 

5. Server-side includes for building documents out of 
country and language-specific components. 
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Each of these is described in detail in the following 
sections. 

3.63.1 WPReplace 

The WPReplace function replaces a string with its trans- 
lation. 5 

The string to be replaced and the currently selected 
language are used as search keys within a TermDB. The 
TermDB is accessed and returns the translation of the string 
in the specified language. 

The WPReplace function is implemented with two tags: io 

<!--WPReplaceBegin-*> 

<!»WPReplaceEnd-»> 

The string to be replaced is enclosed between the tags: 
<!--WPReplaceBegin-->String to be replaced s 
<!-WPReplaceEnd-»> 

The string must be in the primary language of the Ter- 
mDB (see Section 3.6.2, "TermDBs (TM)"). 

For example, if a TermDB contained these entries: 

20 

english hello 
french bonjour 
spanish hola 
then, this HTML code 

< ! ~WPReplaceBegin">hello< ! -WPReplaceEnd— > 
would produce 
bonjour 

if the current language were French, and 30 
hola 

if the current language were Spanish. 
Please note the following: 

1. If the string is not found in the TermDB, it is not 
replaced. 

2. The WPReplace commands are removed after replace- 
ment. 

Please see the sample file 'contact. html* included with this 
release for an example of WPReplace. 40 

Please see Section 4.23, "outbound:: Enabling the Output 
Manager," for information on how to specify TermDBs. 

3.63.2 WPSort 

The WPSort function sorts a list of items according to the 
sort rules for the current language 45 
NOTE: this is restricted in the current release. Please see 
Section 1.3, "Known Bugs and Limitations of This Release." 

The text to be sorted is divided into a series of lines, each 
of which contains a key. 

line 1 (sort key-'box'): < . . . box . . . > 50 

line 2 (sort key^'ball'): < . . . ball . . . > 

line 3 (sort key-'car'): < ... car ... > 

After sorting, the lines will be arranged such that the keys 
will be in the correct sort order. 

line 1 (sort key >»' ball'): < . . . ball . . . > 

line 2 (sort key«'box'): < . . . box . . . > 

Line 3 (sort key«*car'): < ... car ... > 

The WPSort function is implemented with five tags: 

<!--WPSortBegin-» 60 

<!~WPSortEnd-» 

<!--WPSortFieldBegin-*> 

<!--WPSortFieldEnd-»> 

<!--WPSortLineEnd—> 65 
<!--WPSortBegin-->and <!--WPSortEnd~>surround the 
entire block of text to be sorted. 
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<!--WPSortFieldBegin->and <!-WPSortFieldEnd- 
>surround the sort key on each line. 

<l— WPSortLineEnd— >delimits adjacent lines. 
NOTE: The first line must start with <!--WPSortBegin- 
>and the last line must end with <!-WPSortEnd-»>. 

WPReplace tags can be included within a sort field, and 
modify a sort key, if desired. 

In the above example, the HTML would be coded as 

<!--WPSortBegin-»> 

«! ~WPSortFieldBegin-->box 

<!-WPSortFieldEnd-->. . . x! --WPSortLineEnd— > 

< . . . <!-WPSortFieldBegin«>ball 
<!»WPSortFieldEnd->. . . x!--WPSortLineEnd-*> 

< . . . <!-WPSortFieldBegin~>car 
<!~WPSortFieldEnd«>. . . > 
<!--WPSortEod-*> 

WPSort can be used to sort a list of languages or countries 
in local-language order. 

For example, consider this bulleted list of languages: 
<!--WPSortBegin-»> 

<Li><A HREF="WPCommand?language=english"> 
<!~WPSortFieldBegin-» 

<!~WPReplaceBegin~>English<!-WPReplaceEnd— > 
<!-WPSortFieidEnd-x/A>[English] 
<!-WPSortLineEnd— > 

<LIxA HREF="WPCommaod?language=frencb"> 
<!-WPSortFieldBegin-*> 

<!-WPReplaceBegin-->French<! -WPReplaceEnd-»> 

<!-WPSortFieldEnd»x/A>[French] 

<!-WPSortLineEnd-»> 

<LIxA H REF-"WPCommand?language-germaQ"> 
<!-WPSortFieldBegin-*> 

<!-WPReplaceBegin">German<!"WPReplaceEnd-» 

< !-WPSortFieldEnd-x/A>[German] 
<!-WPSortEnd^> 

This code will allow the list items to be sorted in the 
correct order, according to the current language. 

In the above example, the WPReplace tags are processed 
first, replacing the words "English", "French**, and "Ger- 
man" with their translations. The translations will then be 
used as the sort keys. 

All WPSort tags are removed before sending the result to 
the browser. 

3.6.3.3 WPCommands 

WPCommands are special WebPlexer commands 
included in a document's HTML. They allow WebPlexer to 
manage country and language selection within a document. 

WPCommands are coded as hypertext links in a docu- 
ment. When a link is clicked, the WPCommand is sent to 
WebPlexer in the request URL. 

When WebPlexer receives the WPCommand, it performs 
some function and returns the result to the browser. 

WPCommands have the form: 

WPCommand?<command>»<arg> 
where <command>is one of 

1. link 

2. country 

3. language 

and <arg>is a function of <command>. 
Each of these commands is described in more detail in the 
following sections. See also Section 3.3.5, "Manual Lan- 
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guage Selection,** for a discussion of how WPCommands are For example, if the current language were German, this 

used within the HTML. sequence 

Unk <!--WPReplaceBegin-->&language<!-- 

WPCommand?link-<link-target> WPReplaceEnd-^> 

The link command makes a hypertext link to the URL 5 would be converted to 

specified by <link-target>. This command has the side effect Deutsch. 

of notifying WebPlexer that a hypertext link was executed. &fl ag 

This information is used to save the location where the &fl ag & a placeholder for the name of the gif file 
browser came from, so it can return after country or lan- containing the flag of the currently selected country. The 
sel ^ t [ on - 10 name is automatically prepended with 'flag/'. The Web- 
When WebPlexer receives a link notification, it saves the pi eX er release includes gif images of flags from the countries 
current value of the referer URL in the http request for this 0 f the world. 

session. After a change country or change language com- For example, if the current country were Germany, this 

mand is executed (see next sections), the saved referer URL sequence 

is restored and the browser returns to the page from where 15 ^- „ /• _/ , u,™ in* « n 

the link was issued. <ung src = "A^age/<!-WPReplaceBegm«>&flag 

The link command is used when the change country or <!~WPReplaceEnd-->" height=30> 

change language function is implemented on a separate would ^ 0°™^ to 

page. This allows the browser to return to the originating <ulI f> sro"../image/flags/Germany.gif * height-30> 

page once the new country or language is selected Please 20 3.6.3.5 Server-Side Includes 

see Section 3.3.6.2, "Using WPCommand?link," for addi- An ordinary server-side include is an HTML tag which 

tional information. causes an external file to be included within an HTML 

An example of the link command is included in the document. The file is included when the document is served 

sample documents included with this release. The change to me browser. 

country and change language functions are implemented in 25 WebPlexer's server-side include extends this capability 
the files 'countseLhtmr and 'langsel.html*, respectively. by making the decision of what file to include based on the 
These pages are reached by issuing a WPCommand?link= current country, region, or language. This enables a docu- 
<link-target>from the index and form pages. ment 10 t> e bmh dynamically out of country and language- 
Country specific components. 

WPCommand?country=<newKX)untry> 30 WebPlexer supports two kinds of server-side includes: 

The country command changes the current country to 1- Multi-country server-side includes (MCSSI) 

<new„country>. 2. Multi-language server-side includes (MLSSI) 

The current language may also change depending on the MCSSI permits locale-specific elements of a document to 

session state. (See Section 352, "Determine Language from be included as a function of the current region or country. 

Country**). 35 MLSSI permits localized elements of a document to be 

Language included as a function of the current language. 

WPCommand?language=<new-language> Multi-Country Server-Side Includes (MCSSI) 

The language command changes the current language to The MCSSI feature provides the ability to dynamically 

<new_Janguage>. insert country and regional-specific content into the HTML 

3.6.3.4 Custom Variables 40 stream. 

WebPlexer's custom variables are special placeholders MCSSI is invoked when this construct: 

that can be included in HTML code. They are automatically <!-#WP_inchide_MCSSI "file_name**-*> 

replaced by current country and language information when is detected in an HTML document being served to the 

the document is served to the browser. ^ browser. 

Custom variables begin with and must always be When WebPlexer detects this command, it will insert the 

enclosed within file 

<!-WPReplaceBegin-->and <!~WPReplaceEnd->tags. <head_of_mcssi_directory>/<country_or_region_ 

The next sections describe the custom variables supported directory>/<nMe_name> 

in this release. Please see the sample files (e.g. contacthtml) 50 in-line into the HTML stream, 

for examples of their use. Where: 

Acountry head_oL^mcssi_directory is the top-level directory of 

Acountry is a placeholder for the translated name of the we MCSSI ^ on the and fc defined ^ 

currently selected I country. WP_Config file (see Section 4.1, "WP Config," for 

s*leJted D 1™ °° 15 mt ° ^ CUnendy 55 more information) and country_or_region is deter- 

anguage. . , „ mined dynamically according to the algorithm 

For example, if the current country were "United-States" described in "Determining the country_or_region_ 

and the current language were Spanish, this sequence directory ** 

<!--WPReplaceBegin-->&country<!-- The file which is included by MCSSI can itself contain 

WPReplaceEnd-*> 60 additional WebPlexer tags (such as WPReplace and Custom 

would be converted to Variables, see Section 3.6.3.1, "WPReplace," and Section 

Estados-Unidos. 3.6.3.4, "Custom Variables'*). It can also contain additional 

&language MCSSI and MLSSI includes (see "Multi-Language Server- 

Alanguage is a placeholder for the translated name of the Side Includes (MLSSI)'*). 

currently selected language. 65 By supporting WebPlexer tags such as WPReplace within 

The name of the language is translated into the currently the included MCSSI file, regional-specific content can be 

selected language. further localized according to language. This feature allows 



06/24/2003, EAST Version: 1.04.0000 



US 6,526,426 Bl 

33 34 

considerable flexibility in serving specific content for a Then a primary search would look for the file in: 

region or country where more than one language is spoken. /usr/locaVetc/httr^/htdocs/mcssi/Japan/xyz.html 

Please note that language-dependent content is normally b. Secondary: 

not included in MCSSI files and that template tags (see If the file is not in the primary location, the secondary path 

Section 3.6.1.1, "Template Types") such as WPReplacc (see 5 will be checked: 

Section 3.6.3.1, "WPReplace") or MLSSI (see "Multi- <head_of_mcssi_directory>/<current_jregion>/ 

Language Server-Side Includes (MLSSI)") should be used <mcssi_file> 

instead. where current__region is either the user-defined or Web- 

Language-dependent content means any static text that is Plexer pre-defined (default) region for this country. 

present in the file. Including language-dependent content 10 Example: 

within an MCSSI file is not recommended since an MCSSI Assume a statement of the form: 

file is included based on the setting of the current country. <!-- #WP_inchide_MCSSI "xyz.htmr"-*> 

In other words, MCSSI files should contain only country or anc * 

regional-specific content and be independent of language. current country is Japan 

For example, if the current country of the browser is 15 current_region is Pacific-Rim 

Switzerland, the user might be viewing the site in either head_of_mcssi_directory is <docroot_directory>/ 

French or German. If an MCSSI file for Switzerland is mcssi. 

included that contains static text in French, the language will a secondary search would look for the file in: 

be wrong if the browser is requesting German. /usr/local/etc/httpd/htdocs/m 

This problem can be easily solved by replacing the static 20 c - Tertiary: 

text in the MCSSI file with template tags, either by . If me file * not me secondary location, the tertiary path 

1. Placing the translations of the static text in a TermDB, will be checked. 

and using WPReplace tags in the MCSSI file, or <head oOn^d^ctor^/<mcssi_file> 

„ ™ . . i • r , . 5. If the mcssi_file is not found in either the primary, 

2. Placing each language version of the text m a separate secondary, or tertiary locations, the file is considered to 
file and including it with a single MLSSI tag within the 23 ^ « not found » the following message will be 
MCSSI file. injected into the HTML stream in place of the MCSSI 

Determining the country_or_region_directory directive: 

The country or region directory is determined dynami- [an error occurred while processing MCSSI directive] 

cally from a combination of the following parameters: Multi-Language Server-Side Includes (MLSSI) 

the browser's currently selected country 30 MLSSI allows the Web site designer to further customize 

the user-defined region in the WP_Regions file for the M, 11 H ™ L l 0 ^^^^^.^^' T™. "T*" 

, , i cally than what can be provided by the WPReplace tags (see 

selected country 3 fi3 ^ u^p^^ 

the WebPlexer default region for the selected country Like MCSSI, MLSSI allows a file to be included in-line 

where the file is located (i.e. whether it is in the primary, 35 by referencing it within an HTML document. However, the 

secondary,or tertiary directory as defined below). path to the file is determined by the currently active 

The specific algorithm for determining the count ry__or language, rather than by country or region. 

region directory from the above information is as follows. The referenced file is pre-translated and placed in each of 

1. The browser's current country is obtained. See Section &e language d^tories referenced in the WP Mapfile (see 
i c ur> j r _ \ M *»r*r 40 Section 4.2, WP_Map, for a description of the WP___Map 
3.5 Country and Unguage Management, for infor- filc) mML d^^ 

mation on how this is done. mdudcd by MLSSI ^ rctrieved from thc directory 

2. Hie WP_Regions file is consulted to determine if this corresponding to the currently active language and inserted 
country is contained within a user-defined region. See in-line into the HTML stream. This allows a single HTML 
Section 4.9, "WP __Regions, n for a definition of the 45 document to be easily customized by including components 
syntax of the WP_Jlegions file. that are specific to the selected language of the browser. 

3. If there is no user-defined region for this country, the MLSSI is invoked when this construct: 
WebPlexer pre-defined region is used instead. See <!-#WP include MLSSI "file name"-*> 

Appendix D., "List of WebPlexer Default Regions," for is detected in an HTML document being served to the 

a list of pre-defined regions. 5 0 browser. 

4. WebPlexer will look in 3 locations for the target MCSSI When WebPlexer detects this command, it will insert the 
file. These 3 locations are known as the primary, ^ e 

secondary, and tertiary MCSSI directories, respec- <language__directory>/<file_name> 

uvely. in-line into the HTML stream. 

These directories are consulted in order, one at a time until 55 Where: 

the requested file is found. The directories are defined below. language_directory is the directory specified in the 

a Primary: WP_Map file and language is determined according to 

<head__of_„mcssi__directory>/<current-country>/ me ****** outlined in Section 3.5, "Country and Lan- 

<mcssi_file> g^ge Management." 

Example: 60 The ^ e w hich is included by MLSSI can itself contain 

Assume a statement of the form: additional WebPlexer tags and commands (such as 

i mm - u a hjh-oct « u* i» WPReplace, see Section 3.6.3.1, "WPReplace"). It can also 

<!~#WP__inchide„MCSSI "xyz.html ^ . r 1urcci . w PCO , . * * , ' 

J contain additional MLSSI and MCSSI includes. 

^ If the MLSSI filename is not found on the server, the 

current country is Japan 65 following message will be injected into the HTML stream in 

head_of_mcssi_directory is <docroot_directory>/ place of the MLSSI directive: 

mcssi. [an error occurred while processing MCSSI directive] 
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3.7 Multilingual Navigator (TM) 3.9 Document Manager 

The Multilingual Navigator is a collection of WebPlexer- WebPlexer's document manager manages the selection of 

enhanced HTML code which can be used to easily provide documents to serve based on the current country and lan- 

language and country selection for any document on the , » ttat document? be placed in parallel 

Web site. It provides complete language and country selec- 5 ^"T^ ^ server ^ s ,^ "P* » 

... ... ., X„, .7 , • . i- , , also requires that the location of language directories be 

tion without the need for CGI scripts or complex interlinked specified in the WP Map fik. 

v ^ * The following sections provide additional information. 

The Multilingual Navigator is easily customizable to meet 3 9 #1 Arrangement of Documents on the Server 
the specific navigation needs of your site. 10 The document manager requires that translated docu- 

The Multilingual Navigator is included with this release ments be arranged in parallel language directories off of the 

in a file called 'mlnav.html*. It is also included on the Web server's htdoc root. Please see Section 3.3, "Language 

index.html and contact.html pages in the sample directory. Selection," for detailed information on how to arrange the 

document hierarchy for the document manager. 

3.8 Input Manager 15 392 SPAMM (Sparse Matrix Manager) 

WebPlexer's exclusive document management feature 

WebPlexer's input manager allows forms-handling CGI known as "SPAMM" (Sparse Matrix Manager) provides an 

scripts to be intercepted and user input extracted for later incremental approach to building a multilingual Web site, 

translation. It can be used to support the translation of SPAMM allows the site administrator to translate only a 

customer feedback forms in a multilingual environment ^ portion of documents on the site, and into only those 

To use the input manager, the name of the CGI script to 2 ° languages that are most relevant 

be intercepted is specified in WebPlexer's WP__Map file SPAMM is a feature that makes it unnecessary to fully 

along with the location of a template file and an output P°P ulate me site ' s l*W matrix completely with docu- 

(.prt— "pre-translated") file. m6 "*: t . , t u _. , , . . 

vr . ' If the set of documents on a multilingual Web site is 

There is one line at the end of the WP_Map file for each 2s visualized as a matrix, with each row representing a lan- 

CGI script to be intercepted. The format of this line is: guage and each column representing a document, SPAMM 

inbound:: <cgi-name><template-file><.prt file> eliminates the need to populate every cell of the matrix. 

Please see the sample WP_Map file included with this When a document is requested in a particular language, 

release for an example. WebPlexer will go to that cell of the matrix and try to 

The template file tells the input manager which fields of 30 retrieve the corresponding document, 

the form should be extracted. When the CGI script is If me document is not there (i.e. it has not been translated 

invoked, the input data from those fields is saved and ^ mc requested language), WebPlexer will automatically 

appended to the output, or .prt (pre-translated) file. This file retrieve it from the default language directory specified by 

also contains the time stamp, current country and language, "default::" declaration in the WP Map file for this 

and domain of the requester! 35 top4evel URL (see Section 42 "WP_Jtfap,» for an expla- 

3.8.1 Template Definition to ^ m * default d ^ t0 T 7>; 

rp» . . . 1 * *i 1- *i_ * n • * + SPAMM is completely transparent in operation. It does 

in P ut mana S er tem P late file has the foUowmg format: not require ^ gpe S al or configuration. It also works 

<FIELD-LIST> with MLSSI so that if an included file cannot be found in the 

<fieldl>::L current language directory, it will look in the default direc- 

<field2>::L 40 tory instead. 

Please note that even when SPAMM fetches a document 
from the default directory, WebPlexer still maintains the 
correct settings of country and language. This means that if 
a browser clicks on a link from within a document that was 

<fieldn> *L 45 relcne< * fr° m the default language directory (through 

1 i<5T FNm SPAMM), WebPlexer will look for the new document in the 

<USI END> currently active language directory, not in the default lan- 

where <FIELD__JJST>and <LIST_END>are required mgc ^ctoiy 

tokens and the remaining lines list the field names used in 393 Specifying Language-to-Directory Mapping 

the HTML form. The <::L* at the end of each line is required. 50 ^ mapping 0 f language to directory must be specified in 

For example, if an mput field in an HTML form were ^ W P_Map file so the document manager can locate 

as: documents in the correct language. 

<INPUT TYPE="text" NAME-«phone" SIZEo40 Please see Section 3.3.2.2, "Spedfying the Language-to- 

VALUE~""> Directory Map and Charset," for detailed information on 

the matching line in the template file would be 55 how to specify this mapping, 

phone ::L 3,9.4 Specifying Charsets 

Please see the files 'contact.html' and 'contacttmpl' Please sec Section 3.3.2.2, "Specifying the Language-to- 

includedin the sample directory of this release for additional Directory Map and Charset," for detailed information on 

examples. how to specify the charset used to encode a document. This 

3.8.2 .prt File Description 60 information is used by the document manager. 

The .prt file is the output file that contains data from the 3.9.5 Using Multi-Country Server-Side Includes (MCSSI) 

extracted fields specified in the template file. MCSSI allows an HTML document to be easily custom- 
Each entry of the file is preceded by the time stamp, ized with country and regional-specific content using a 

current country and language, and host domain. highly modular, "building-block" approach. This reduces 

The .prt files can be queued by the console for later 65 the number of individual files that need to be kept on the 

translation. Please see Section 3.10.10, "Maintaining Input server, lowering the cost and time required to maintain the 

Manager Database Files," for more information. site. 
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One application of MCSSI is to insert a targeted, country- a. Place country-specific content in these directories: 

specific banner at the top of an HTML page. It can also be <head_of_mcssi directory>/<country>/<mcssi_file> 

used to easily tailor a marketing message for a specific where: 

country or region. For example, a travel company could use <head_of_mcssLjdirectory> 

this feature to publish different airfares to various countries 5 is the top-level directory of the MCSSI area on the server 

or regions. Or, tour packages or travel information could be and ^ defined in the WP_Config file (see Section 4.1, 

presented only to those countries or regions where it is "WP_Config," for more information), 

relevant. . <country> 

_ . n . . . , , r TTm „ , is the name of the desired country. 

The following is a simple example of an HTML document 1Q <mcss j fii e> 

that shows how MCSSI is used. is the target file to be included. 

<HTML> b. Place regional-specific content in these directories: 

<BODY> <head_of_jncssL_directory>/<region>/<mcssL_file> 

<!-WPReplaceBegin->Welcome to the ABC Travel 15 W < n ead_of_mcssi_directory> 

^ tc! is the top-level directory of the MCSSI area on the server 

<!-WPReplaceEnd-*> and is defined in the WP_Config file (see Section 4.1, 

<!--#WP_inchide_MCSSI "airfares.htmr-*> "WP_Config," for more information). 

</BODY> • <rc S ion> 

20 is the name of the desired region. This is either a user- 
</HTML> defined region from the WP_Regions file, or a WebPlexer 

When this document is processed, WebPlexer will first pre-defined region given in Appendix D., "List of WebPlexer 
determine the current language and replace the string "Wei- Default Regions." 
come to the ABC Travel Site" with its translation out of the <mcssi_file> 

TermDB. Please see Section 3.6.3.1, "WPReplace" for a 25 fa tne tw 8^ t ^ to be included. 

description of WPReplace tags and how the replacement is c - Place generic content (all regions and countries) in this 
done. Note that this is not part of the MCSSI operation but directory: 
is included to make the example a bit more realistic. where: 

Next, WebPlexer will process the MCSSI statement as <head_of_mcssi_directory> 

follows: 30 is tDC top-level directory of the MCSSI area on the server 

1. The currently selected country is determined. Please see * nd ^ defined in the WP„Config file (see Section 4.1, 
Section 3.5, "Country and Language Management," for "WP_Config"). 

an explanation of how this is done. . <mcssi-file> 

2. The appropriate region that this country belongs to is fe thc tM BP t file t0 bc included. 

determined using user-defined regions from the 35 4. Include a statement in your HTML of the form: 
WP_Regions file and/or WebPlexer predefined (default) <!-#WP_jnchide_JtfCSSI "airfares.html"-^> 
regions. Please see Section 4.9, "WP_Regions," and whcrc "airfares-html" 1S the name of the target MCSSI file. 
"Multi-CountryServer-SideIncludes(MCSSI)-formore 5 * men ^SSSf^ ^m? ? ^ WhC ° 

information about bow regions are used. M processing the HTMU it wdl fetch the file from the appro- 

3. Based on the analysis of step 2, the target file, 40 

« u* m * * . w j ^ .t. ?• 'it first), and insert it in-line into the HTML stream, 

airfares, html , is fetched from the appropriate directory 6. Any WPReplace or WebPlexer Ugs that are found in the 

on the server. Please see "Determining the country_or„ mwmbc ^ ^ inserted. 

region_directory," for a description of how this directory Please note mat language-dependent content is normally 

is determined. 4S not included in MCSSI files and that template tags (see 

4. The target file, "airfares.html" is inserted in-line into the Section 3.6.1.1, "Template Types'*) such as WPReplace (see 
HTML stream. Any WebPlexer tags (e.g. WPReplace) Section 3.6.3.1, "WPReplace") or MLSSI (see "Multi- 
that are included within the file are subsequently pro- Language Server-Side Includes (MLSSI)") should be used 
cessed. instead. 

Note that the included MCSSI file can contain additional 50 Language-dependent content means any static text that is 

WebPlexer tags, including MLSSI and WPReplace tags. present in the file. Including language-dependent content 

This allows the MCSSI file to be further customized for the within an MCSSI file is not recommended since an MCSSI 

current language, by replacing strings with their translations file is included based on the setting of the current country, 

using the TermDB. In other words, MCSSI files should contain only country or 

MCSSI is very easy to use by following the steps given 55 regional-specific content and be independent of language, 

below. For example, if the current country of the browser is 

1. Determine which countries or regions are important for Switzerland, the user might be viewing the site in either 
your site, based on the country-specific or regional- French or German. If an MCSSI file for Switzerland is 
specific content you wish to serve to visitors to your site. included that contains static text in French, the language will 

2. Group countries together into those regions of interest and 60 be wrong if the browser is requesting German. 

edit the WP_Regions file to identify those regions to This problem can be easily solved by replacing the static 

WebPlexer. Please see Section 4.9, "WP_Jlegions," for text in the MCSSI file with template tags, either by 

information on the syntax of the WP_Regions file. 1 . Placing the translations of the static text in a TermDB, and 

3. Create country and region directories on your server as using WPReplace tags in the MCSSI file, or 

shown below, and place in those directories, any country- 65 2. Placing each language version of the text in a separate file 

specific or regional-specific parts of a document you wish and including it with a single MLSSI tag within the 

to have managed by MCSSI. MCSSI file. 
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3.9.6 Using Multi-Language Server-Side Includes (MLSSI) 

MLSSI is typically used to customize an HTML docu- 
ment by including HTML that might be too complex to be 
managed by WPReplace tags. For example, MLSSI can be 
used to include a language-specific heading at the top of a 5 
form constructed out of WPReplace tags. 

Please see Section 3.6.3.1, "WPReplace," for further 
information about WPReplace tags. 

MLSSI also helps to separate the form of a Web site (the 
structure and layout of the site) from its content (the actual 10 
message to be displayed on the browser). 

The design and layout of a multi-language site is usually 
very similar, and often invariant, across languages. MLSSI 
allows the language<lependent parts of the content to be 
isolated from the structural design making it easy to update 15 
the design of the site and have changes reflected automati- 
cally across all of the languages. Likewise, MLSSI allows 
language content to be easily updated and maintained with- 
out affecting the overall site design. 

Another application of MLSSI is to include language- 20 
specific graphics or image maps within a template file. 

For example, a form-based HTML document that is a pure 
template can be customized with a language-specific banner 
and graphic at the top of the file as follows: 

<HTML> 25 

<BODY> 

<! --#WP_inchide_MLSSI "banner.html"^> 
<INPUT NAME -"name" ><!«WPReplaceBegin- 

>name<! — WPReplaceEnd-*> 
<INPUT NAME -"address" ><!«WPReplaceBegin- 

>address<! -WPReplaceEnd-*> 
</BODY> 
</HTML> 

When this file is processed, the file "banner.html" will be 
fetched from the language directory specified in the 
WP_Map file (i.e. frencrVbanner.html, german/banner.htmL, 
etc.) 

The contents of the banner.html files might look some- 
thing like: 

french/bannerhtml: 

<IMG SRC="images/french/bannergir WIDTH- 
80HEIGHT=20>Bienvenue<br> 
german/banner.html: 45 
<IMG SRC-"images/german/banner.giT WIDTH- 
10OHEIGHT«20>Wilikommen«*r> 
and so on. 

Of course, the WPReplace strings ("name", "address") 
will also be replaced by their translations from the TermDB 59 
during processing of the file. See Section 3.6.3.1, 
"WPReplace," for a description of how this is done. 

As can be seen, MLSSI can be used to separate out the 
language-specific components of an HTML document mak- 
ing it easier to maintain the collection. In the previous 55 
example, if the structure of the HTML form needed to 
change, it would only require changing the template file, and 
not any of the language-specific files on the site. 

MLSSI is very easy to use by following the steps given 
below. 60 

1. Place any language-specific parts of a document you wish 
to have managed by MLSSI in the respective language 
directories specified in the WP _Map file. 

2. Include a statement in your HTML of the form: 
<!--#WP_Jnclude_JtfLSSI "component.html"-*>where 65 

"component .html" is the name of the file created in 
step 1. 
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3. When WebPlexer detects the MLSSI include when pro- 
cessing the HTML, it will fetch the file from the language 
directory, as defined in the WP_Map file, corresponding 
to the current language selection, and insert it in-line into 
the HTML stream. 

4. Any WPReplace or WebPlexer tags that are found in the 
inserted file will be resolved after the file is inserted. 

3.9.7 Using CGI Scripts 

CGI scripts producing HTML output should be written to 
be as independent of language and country as possible. This 
means these scripts should use the HTML extensions pro- 
vided by the Multilingual ToolKit. They should write out 
templates that do not contain language-specific or country- 
specific information. Having only one script to maintain 
means that changes can be made easily and efficiently. 

When the HTML output of such a CGI script is presented 
to the browser, WebPlexer will fill in the template dynami- 
cally with current language and country information. Text 
that is marked-up with WPReplace tags will also be trans- 
lated. 

In order for WebPlexer to correctly process a CGI script, 
the filename of the script must be changed when it is 
referenced in an HTML document. The filename must not 
include the path to the cgi root, even though the script is 
actually located in that directory. 

Id addition, the cgi root must be specified in the 
WP_Cbnfig file (sec Section 4.1.18, "cgiRoot"). 

For example, suppose that the cgi root is "/cgi-bin". 

Normally, a reference to a CGI script called "script.cgi" 
would look like this: 

<A HREFc 3 "/cgi-birVscriptcgi ,, >CGI</A> 

But with WebPlexer, the cgi root should be omitted: 

<A HREF="scTipLcgT>CGI</A> 

This will ensure correct country and language processing 
of the HTML. 

3.10 WebPlexer Console 

3.10.1 Setting the Console Port 

3.10.2 Setting the Console Password 
3.103 Invoking the Console 

3.10.4 Closing the Console 

3.10.5 Writing Console Templates 

3.10.6 Console Commands 

3.10.6.1 System Configuration 

3.10.6.2 URL Selection 
3.10.63 System Status 

Status Annunciators 
3.10.6.4 Language Status 

Status Annunciators 
3.10.65 Queue Status 

3.10.7 Translation Queues 

3.10.7.1 Enqueue 

3.10.7.2 Dequeue 

3.10.73 Maintaining the Translation Queues 
3.10.7.4 Initiating Off-line Translation Using qtrans 

3.10.8 Document Maintenance 

3.10.8.1 Including Document Types 

3.10.8.2 Including Specific Documents 
3.10.83 Excluding Documents 

3.10.9 Maintaining TermDBs 

3.10.10 Maintaining Input Manager Database Files 
3.10.1 1 Log Analysis 

3.10.1 1.1 Usage utility 
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3.10.11.2 Visits utility 4.1 WP_C0nfig 

wlbpS^^ra^umber of command line flags used WP_Config file conUiiis site-specific information 

for configuration and debuggmg. These are described below. USed t0 xt up and 00nfi g ure W «*Pl«er. 

3.11.1 Configuration Flags 5 WP_Config file is similar in format to a "httpd .conf ' 

-internetport port ^ e ^ consists of a series of key/value pairs, one per line. 

-serverport port Akey is a non-blank string ending with one or more white 

-consoleport port spaces. Examples of keys are InternetPort and ServerPort. 

These flags override the InternetPort, ServerPort, and A value is a parameter string and varies depending upon 

ConsolePort specified in the WP Confie file. me ke ^ Exam P les of values are me strm gs "80" and 

-alogfile " "logs/WPErrorIx,g". 

-cloe file K&ys are separated from values by one or more column 

» fi . delimiters (spaces or tabs). Any text on a line after a 

~ 0g ^ • - s . T t e comment symbol (#) is ignored. 

These flags override the AccessLog, ErrorLog, and Infor- 15 ^ , , t v 7 

mationLog specified in the WP„Cbnfig file. ^ e ke > /values currently defined for WP_Config arc 

gj e listed in the sections below. 

™ . a . n n ... 4.1.1 InternetPort 

This flag specifies a configuration file to use other than the « ( 

default WP_Config. ^ 

-version 20 IntemctPort 

tw *u f w um j where <port>is the port on the Internet side that WebPlexer 

Inis nag displays the current version of WebPlexer and c ^.L . ««_ «r 

libraries monitors lor an http request When WebPlexer is used in 

"production mode" (see Section 3.1.1.2, "Production 
Mode"), this value will be 80, the standard http port. 



-restart 

This flag forces WebPlexer to go through a restart Example 
sequence. 

3.11.2 Debugging Flags 
-v 



InternetPort 80 
4.1.2 ServerPort 
Syntax: 



ServerPort <port> 

"^ e ^ u S 50 where <port>is the port used to communicate with the httpd 

These flags cause WebPlexer to send debugging and server. It should be set to an unused port (e.g. 180) when 
informational messages to stderr. They should not be used in WebPlexer is in "production mode" (see Section 3.1.1.2, 
normal operation as they will cause some amount of per- "Production Mode"). It must match the port specified in the 
formance degradation. Type ./webplexer -h fbr more infor- Web server's configuration file. 
mation - 35 Example: 

~* m< V t „ ServerPort 180 

This flag uses the specified file to activate WebPlexer's 4^3 ConsolePort 
test mode. The test file contains a list of up to 64 domain Syntax* 
suffixes that will be consecutively "spoofed" for each access. ~ , 

For example, if the file contained: 40 ConsolePort <port> 

, , . where <port>is the port used to communicate with the 

u * 1-1 - WebPlexer console, 

the nrst access would appear to come from France, the next Examnle* 

from Germany, and the last from Italy, then the cycle would * 

repeat. Depending on the access pattern, and the contents of ' ~° ™° , 2800 

the document being retrieved (e.g. if there are embedded 45 4.1.4 AccessLog 

graphics), it may be necessary to repeat the same suffix more Syntax- 

than once in the file to achieve the desired result. AccessLog <filename> 

.jjobg where <filename>is the name of WebPlexer's access log file. 

Hiis flag tells WebPlexer to run in the foreground and not lt ^ be relative t0 me We *>Plexer directory or it can be a 

to fork a child process on startup. 50 me Dame * 

-noredirection ^ 

This is a debugging flag and should not be used in normal AccessLog logs/WPAccessLog 

operation. 415 ErrorLog 

-nosigs 55 S y ntax: 

This disables WebPlexer's exception handling. Normally, ErrorLog <filename> 

WebPlexer notifies the client when an exception occurs (if where <filename>is the name of WebPlexer's error log file, 

possible). ft 0311 De re l at i ve to the WebPlexer directory or it can be a 

-noforking ^ s P cci fi ed name. 

This is another debugging flag that forces requests to be 60 Example: 

handled by the parent process. ErrorLog logs/WPErrorLog 

-h, -help 4 *1*> InformationLog 

These flags list and describe the help options. Syntax: 

, _ InformationLog <filename> 

4. Contents of WebPlexer Files ^ . ^, . - „, ,™ , . - 

65 where <filename>is the name of WebPlexer s mformation 

The following sections give detailed specifications of all log file. It can be relative to the WebPlexer directory or it can 

WebPlexer external files. be a fully specified file name. 
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Example: 

InformationLog logs/WPInfoLog 

4.1.7 ServerName 
Syntax: 

ServerName <name> 
where <name>is the DNS entry name of this server. 
Example: 

ServerName wwwJai.com 

4.1.8 RemoteMaintenanceKey 
Syntax: 

RemoteMaintenanceKey <off I on> 

This should be set to "off" in the current release. 

4.1.9 Console 
Syntax: 

Console <off I on> 

Disables/enables the WebPlexer console. 

4.1.10 MasterLanguage 
Syntax: 

MasterLanguage <language> 

Specifies the language to be used as the master or refer- 
ence language for the console (i.e. documents in that lan- 
guage will be used as a reference when determining which 
languages are out of date). 
Example: 

MasterLanguage english 
NOTE: The language must be one of those supported by 
WebPlexer (see Appendix B., "List of Available 
Languages"). 

4.1.11 ConsoleTextExtensions 
Syntax: 

ConsolcTextExtensions <extl ext2 . . . extN> 

The extensions specified as extN indicate what filetypes 

will be under console maintenance functions. 

Example: 

ConsoleTextExtensions .html .htm 

4.1.12 ConsoleFilelnclude 
Syntax: 

ConsoleFilelnclude <filel file2 . . . fileN> 

Specifies individual files to be included under console 

maintenance functions beyond those included using the 

ConsoleTextExtensions directive. 

Example: 
ConsoleFilelnclude faq.txt info.txt 

4.1.13 ConsoleFileExclude 
Syntax: 

ConsoleFileExclude <filel file2 . . . fileN> 

Specifies individual files to be excluded from console 

maintenance among the set that was included using the 

ConsoleTextExtensions directive. 

Example: 

ConsoleFileExclude dummy.html empty.html 

4.1.14 Enqueue 
Syntax: 

Enqueue <directory> 

Specifies the location of the console's enqueue directory. 
NOTE: This directory is relative to the DocumentRoot (see 
below) 
Example: 

Enqueue enq 

4.1.15 Dequeue 
Syntax: 

Dequeue <directory> 
Specifies the location of the console's dequeue directory. 
NOTE: This directory is relative to the DocumentRoot (see 
below) 



Example: 

Dequeue deq 
4.1.16 Plugins 

Pluglns use the WebPlexer API. Information on Plugins is 
5 not available in this release. However, the WP__Config file 
should contain the following two lines: 

Pluglns MLFWI Early 5 

Plugins MLFWO Late 5 
io 4.1.17 DocumentRoot 
Syntax: 

DocumentRoot <path> 
where <path>is the path to the Web server's document root. 
Example: 

15 DocumentRoot /usr/local/etc/httpd/htdocs/ 
4.1.18 cgiRoot 
Syntax: 

cgiRoot <path> 
where <path>is the path to the Web server's cgj root, relative 
to the document root. 

NOTE: <path>should not contain a trailing slash. 
Example: 

cgiRoot /cgi-bin 
25 4.1.19 MCSSIRoot 
Syntax: 

MCSSIRoot <path> 
where <path>is the path to the head of MCSSI directory, 
relative to the document root. 
30 Example: 

MCSSIRoot/webplexer/MCSSI 
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4.2 WP_Map 

The WP__Map file contains a list of top-level URLs that 
are subject to language selection. URLs are organized in 
URL groups, and as many groups as necessary can be 
specified. Each group contains the following information: 

1. The top-level URL of the group. 

2. The list of languages in the group and the names of the 
directories where the documents are located. 

3. The encodings in which the documents are represented 
(e.g. ISO-8859-1) 

45 4. Default directories in case the requested language is not 
found (usually English). 

5. TermDB(s) used when processing files for this group. 

6. Input Manager files (CGI script, templates, output files) 
for any forms contained within this group. 

The format of the file is as shown below. Any text after a 
comment symbol (#) is ignored. 
# URL Group 1 definition 
URL: : <path l><path_type> 
55 URL::<path2><path_type> 



50 URL: : <pathN><path_rype> 

<language>:<language_directoryxcharset> 
<language>:<language_directoryxcharset> 
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default:<default_Janguage><charset># REQUIRED 



50 
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others:<others_languagexcharset># REQUIRED 



outbound:: <TermDBl.l> 
outbound:: '** <TermDB1.2> 

outbound:: <TennDBl.N> 



10 



inbound:: <cgi-scriptl.lxtemplate-filel.lx.prt filel.l> 
inbound:: <cgi-scriptl.2xtemplate-filel.2x.prt filel.2> 

inbound:: <cgi-scriptl.Nxtemplate-filel.Nx.prt filel 15 
.N> 

# URL Group 2 definition 
URL::<pathlxpath_type> 

URL::<path2xpath_type> 20 



URL: : <pathNxpath_type> 

<language> :<language_directoryxcharset> 

<language>:<language_directory><charset> 



default:<default_Janguagexcharset> #REQUIRED 
others:<others_languagexcharset> #REQU1RED 
outbound:: <TennDB2.1> 
outbound:: <TermDB2.2> 

outbound:: '** <TermDB2.N> 



inbound:: <cgi-script2.1xtemplate-file2.1x.prt file2.1> ^ 
inbound:: <cgi-script2^xtemplate-file2.2x.prt file2.2> 

inbound: : <cgi-script2.Nxtemplate-file2.Nx.prt 
file2.N> 

# URL Group N definition *5 



25 
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defaultCountry <country> 

defaultLanguage <language> 
4.2.1 Specifying the Top-Level URL 
Syntax: 

URL::<pathxpath_type> 
where 

<path> 

is the directory path of the top-level URL below the htdoc 
root. 

<path__type> 

is either 'd* or T depending on whether the path is a 
directory or a file. 

Examples of valid paths and path_types are: 

/d 

/userl d 
/userl/docs d 
Aiserl/docs/info.html f 
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You can specify as many paths as needed within a URL 
group. 

A directory specification only matches a request for that 
directory. It will not match any requests for files in that 
directory. 

For example, 

URL::/userl d 
would match 

GET http://www.lai.com/userl 

GET http://www.lai.com/userl/ 
but would NOT match 

GET http://www.lai.com/userl/info.htnil 

GET http://www.lai.com/userl/index.html 

These would require additional path specifications 

URL::/userl/info.html f 

URL::/userl/index.html f 
4.2.2 Specifying Language/Directory/Charset 
Syntax: 

<language>: <language_directoryxcharset> 
where 

<language> 
is the name of the language. 

<language directory> 

is the name of the subdirectory containing documents of that 
language. 

<charset> 

is the encoding used for documents in that language (e.g. 
ISO-8859-1) 

All the URLs in a group share a common language-to- 
directory map. 

NOTE: <language>must be one of the available languages 
listed in Appendix B., "List of Available Languages." 

Examples of valid language to directory entries are: 

fircnch: french ISO-8859-1 

english: eng_docs ISO-8859-1 

Catalan: spanish__dir ISO-8859-1 

Japanese: japanesc ISO-2022-JP 

There are two reserved <language>names that have spe- 
cial meaning. One is "default" and the other is "others". 
Both of these must be specified in the directory map of each 
URL group definition. They are described in more detail 
below. 

4.2.2.1 others: 
Syntax: 

others: <others_languagexcharset> 

The "others" language is used when the requested lan- 
guage is valid but does not exist on the server. 

For example, if English, French, and Spanish documents 
exist on the server, but the requested language is Russian, 
WebPlexer will use the language defined as "others". 

It is usually the case that "others'* is set to English. 
Example: 

others: english ISO-8859-1 

4.2.2.2 default: 
Syntax: 

default: <default_Janguagexcharset> 

The "default" language is used when WebPlexer cannot 
determine the language of a request and none has been 
specified by the browser. This can happen when the country 
of origin cannot be determined because the hostname is 
unavailable or when the requester is behind a firewall. 

It is usually the case that "default" is set to English. 
Example: 

default: english ISO-8859-1 
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4.2.3 outbound:: Enabling the Output Manager Example: 

The output manager (i.e. Multilingual ToolKit) is enabled defaultCountry United-States 

by specifying TermDB(s) in the WP_Map file for a given 4.2.5.2 defaultLanguagc 

top-level URL. Syntax: 

All documents containing templates, tags, or other 5 defaultLanguage <language> 

resources of the Multilingual ToolKit which require where 

dynamic processing will be processed through the listed <language> 

TermDBs for this URL group fe me kngMW to be used as WebPlexer's global default 

Syntax: language. 

. m ~ r NOTE: The language must be a valid language from Appen- 

outbound:: <TermDBN> 10 dk ^ ^ Jf*^ LangWfc - ^ 

wnerc Example: 

<TermDBN> defaultLanguage english 

is the name of a TermDB to be applied for this top-level aika rw 

tjj^l 4.3 Map Overrides 

NOTE: The TermDB should be in the main WebPlexer 15 WebPlexer provides two files that can optionally force the 

directory. selection of specific language(s) for certain hostnames or IP 

Examples: addresses when the top-level URL is accessed. 

outbound:: 'master termdb.html - ™f "* HostNamcMa P and IPAddrMap, and are 

— described below, 

outbound:: ^*i^r temdb.html 20 Both mes are ^ WebPlexer's language subdirectory. 

4.2.4 inbound:: Enabling the Input Manager 4.3.1 HostName Map 
To use the input manager, the name of the CGI script to Syntax: 

be intercepted is specified in the WP_Map file along with <hostIDlxlanguage_setl> 

the location of a template file and an output (.pit) file for this <hostID2><lancuace — set2> 

URL group. 25 u s s — 

A separate inbound statement is required for each CGI 
script. 

NOTE: Each inbound statement must specify a different .pit * . Am „ , 

r j r <hostIDNxlanguage_setN> 

where 

The template file tells the input manager which fields of 30 <hostID> 

the form should be extracted. When the CGI script is ^ a h ostname 

invoked, the input data from those fields is saved and , 

appended to the output, or .prt (pre-translated) file. This file • < n ?? a ^l? C 4 > * 1 c* L r 

ate contains the time stam£ current country and language, ^ ordered Lst of one or more languages of the form 

and domain of the requester. 35 1" • • • ; <tanguageN>} 

q . ^ J3 and languageN is preferred over languageN+1 for this 

J> y nlax: hosUD. 

inbound:: <cgi-scriptN><template-fileNx.prt fileN> NOTE: The HostName Map file must be in WebPlexer's 

where 'language' subdirectory. 

<cgi-scriptN> Examples: 

is the name of the CGI script to be intercepted. 40 wwwJai.com {english} 

<template-fileN> www.generichost.com {default} 

is the name of the file defining the fields to be captured. 4 3 2 IPAddress Ma P 

Please see Section 3.8.1, "Template Definition," for a defi- Syntax: 

nition of this file. <ipH)lxlanguage_setl> 

<.prt fileN> 45 <ipn>2xlanguage_set2> 
is the name of the output (pre-translated) file. Please see 
Section 3.8.2, **.prt File Description," for a definition of this 
file. 

Examples: <ipIDN><language_setN> 

50 where 

inbound:: form.cgi form.tmpl form .prt <ipID> 

inbound:: contact.cgi contact.tmpl contact.prt an jp address 

Also see the sample WP_Map file included with this <language^set> 

release. is an ordered list of one or more languages of the form 

4.2.5 Setting Global Defaults ss { < i anglU gpi >( <language2>, . . . , <languageN>} 
Global defaults are declared at the end of the WP_Map and languageN is preferred over languageN+1 for this ipID. 

file and apply to all URL groups. NOTE: The IPAddress Map file must be in WebPlexer's 

4.2.5.1 defaultCountry 'language* subdirectory. 

Syntax: Example: 

defaultCountry <country> 60 192.42.172.9 {farsi, russian, english} 

where 4.4 Log Specifications 

<country> WebPlexer supports four log file types: 

is the country to be used as WebPlexer's global default AccessLog 

country. 65 ErrorLog 

NOTE: The country must be a valid country from the InformationLog 

country.list file. UsageLog 
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The filenames corresponding to these log file types, 
except for the UsageLog, are defined in the WP_Config file 
(See Section 4.1.4, "AccessLog," Section 4.1.6, 
"InformationLog," and Section 4.1.5, "ErrorLog," for addi- 
tional information). The UsageLog is created and main- 5 
tained by the usage script, described in Section 3.10.1 1.1, 
"Usage utility." 

4.4.1 AccessLog 

The AccessLog records information about each request 
received by WebPlexer, such as date, time, and request type. 10 

Each AccessLog entry conforms to the Common Logfile 
Format: 

<remotehostxrfc93 1 ><authuserx[date ] 
x"request"><statusxbytes> 
where the fields are defined as follows 15 
<remotehost> 

Remote hostname (or IP address if DNS hostname is not 
available). 
<rfc931> 

The remote logname of the user. 20 
<authuser> 

The username by which the user has authenticated him- 
self. 
<fdate]> 

Date and time of the request. 
<"request"> 

The request line exactly as it came from the client. 
<status> 

The HTTP status code returned to the client. 30 
<bytes> 

The content-length of the document transferred. 
Examples: 

pp93.satcom.net— {28/Jul/1996:23:37:23 -0700] "GET/ 
cnv.post/infb.html HTTP/1.0" 3Q2-pp93.satcom.net— 35 
[28/lul/1996:23:37:24 -0700] "GET/cnv.post/english- 
american_United-States/info.html HTTP/1.0" 200 
3372 

pp93.satcom. net— [28/Jul/1996:23:37:29 -0700] "GET/ 
cov.post/coming/flags/United-States.gif HTTP/1.0" 40 
200 3594 

NOTE: When WebPlexer is configured for "production 
mode" (sec Section 3.1.1.2, "Production Mode"), WebPlex- 
er's AccessLog should be used in place of the Web server's 
access log for auditing purposes. The Web server's access 
log will show all requests originating from WebPlexer. 

4.4.2 ErrorLog 
The ErrorLog logs any errors that occurred during Web- 
Plexer operation, or during the processing of a request. 

The format of an ErrorLog entry is: 
[date] <Error-Msg> 
Examples: 

[01/Aug/1996:21:Q2:23 -0700] Couldn't bind to socket 
for internet port 3080 55 

4.4.3 Information Log 

The InformationLog records information about Web- 
Plexer language and country selection. An entry is written 
whenever language or country changes, whether automati- 
cally or by manual selection. 60 
The format of an InformationLog entry is: 
<remotehostxrfc931xauthuser><[date]><status- 
messago 

where the fields are defined as follows 

<remotehost> 65 
Remote hostname (or IP address if DNS hostname is not 

available). 
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<rfc931> 

The remote logname of the user. 
<authuser> 

The username by which the user has authenticated him- 
self. 
<[date]> 

Date and time of the request. 
<status-message> 

Status information pertaining to r country or language 
processing. 

Examples of InformationLog entries are: 

pcll3.fiz-karlsruhe.de— {01/Aug/l 996:01:55: 13 -0700] 
Country Selection: 

pcll3.fiz-karlsnihe.de— >Germanyr 

pcll3.fiz-karlsruhe.de— [01/Aug/1996:01:55: 13 -0700] 
Language Set: 

pcll3.fiz-karlsrurie.de-> {german} 

pcll3.fiz-karlsruhe.de— [01/Aug/i 996:01:55: 13 -0700] 
Language Selection: 

pcll3.fiz-karlsruhe.de— >German ; 
4.4.4 Usage Log 

The UsageLog contains a summary of country and lan- 
guage statistics. It is generated from the InformationLog 
using the usage log analysis tool. The UsageLog can be 
further processed using the visits log analysis tool. Please 
see Section 3.10.11, "Log Analysis," for a description of 
WebPlexer* s log analysis tools. 

The format of a UsageLog entry is: 

<[date]x TM Language:J ! | TM Country: /xlanguagel 
countryxremotehost> 
where the fields are defined as follows 

<[date]> 

Date and time of the request. 
<language> 

New language of this request (i.e. an entry is created when 
a visitor enters the site or changes language once on the site). 
<country> 

New country of this request (i.e. an entry is created when 
a visitor enters the site or changes country once on the site). 
Examples of UsageLog entries are: 
[17/Oct/1996:13:40:ll -0700] Language: German 

www07.btx.dtag.de 
[17/Oct/1996:14:09:57 -0700] Country: Argentina 

serverl .sminter.com. ar 

4.5 CountryMap 

The CountryMap contains a database of the world's 
languages and dialects. It is not human-readable. 
4.5.1 Creating a User-defined CountryMap 

4.6 country.list 

The country.list file contains a list of available countries. 
It is identical to the list included in Appendix C, "List of 
Available Countries." 

4.7 WP_LangCode 

The WP_XangCode file contains the iso639 language 
map. 

4.8 WP_Languages 

The WP_JLanguages file contains a list of available 
languages and dialects. It is identical to the list included in 
Appendix B., "List of Available Languages." 
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4.9 WP_Regions 

The WP Regions file consists of a list of names of 

user-defined regions, each one followed by a list of countries 
contained within that region. 
Syntax: 

# this is user-defined regionl 

<user-defined_j , egionl>: : 

<countryl> 

<country2> 
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<countryN> 

# this is user-defined region2 
<user-defined_region2>: : 
<countryl> 
<country2> 
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<countryM> 

Blank lines as well as lines beginning with '#' are ignored. 
Tabs at the beginning of a line are also ignored. 
Example: 
sales__region 1:: 
Japan 
China 
Hong-Kong 
sales_jegion2: : 
France 
Great-Britain 
Germany 

Countries must be chosen from the list of valid countries 
for WebPlexer. See Appendix C, "List of Available Coun- 
tries.** 

The WP__Regions file contains only user-defined regions. 
WebPlexer also maintains a set of pre-defined (default) 
regions. These are listed in Appendix D., "list of WebPlexer 
Default Regions." 

"Determining the country or region_directory," fur- 
ther describes the interaction and relationship between pre- 
defined (default) and user-defined regions. 

4.10 Master__termdb.html 

The Master_termdb is included with the product and 
contains translations of the names of the world's countries 
and languages in the following languages and encodings: 
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Language 


Encoding 


english 


ISO-8859-1 


english-amciican 


ISO-8859-1 


english-british 


ISO-8859-1 


arabic 


CP1256 


portugucse-biazilian 


ISO-8859-1 


Chinese-simplified 


GB2312 


Chinese-traditional 


BIG5 


dutch 


ISO-8859-1 


flnnish 


ISO-8859-1 


french 


ISO-8859-1 


french-canadian 


ISO-8859-1 


french-continental 


ISO-8859-1 
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tan on » at* 
language 


UlILUUlIlU 


gennan 


ISO-8859-1 


greek 


CP1253 


hebrew 


CP 125 5 


Indonesian 


ISO-8859-1 


Italian 


ISO-8859-1 


Japanese 


x-euc-jp 


kinyarwanda 


ISO-8859-1 


kirundi 


ISO-8859-1 


korean 


euc-kr 


Portuguese -continental 


ISO-8859-1 


russian 


CP1251 


samoan 


ISO-8859-1 


Spanish 


ISO-8859-1 


spanish-castOian 


ISO-8859-1 


spanish-latin-american 


ISO-8859-1 


Swedish 


ISO-8859-1 


thai 


CP874 


Vietnamese 


VIASQI 



Although the invention is described herein with reference 
to the preferred embodiment, one skilled in the art will 
readily appreciate that other applications may be substituted 
for those set forth herein without departing from the spirit 
and scope of the present invention. Accordingly, the inven- 
tion should only be limited by the claims included below. 
What is claimed is: 

1. A process for managing, tracking, accounting and 
3Q translating multilingual electronic content in a computer 

environment, comprising the steps of: 

detecting when a document, data stream, or non-text file 

in a master language has been updated; 
notifying the user which corresponding documents, data 
streams, or non-text files in the other languages require 
translation; 

allowing the user to initiate the translation of a document, 
data stream, or non-text file and its constituent or 
dependent elements; 
converting said document, data stream, or non-text file 
and its constituent or dependent elements requiring 
translation to an internal format; 
staging the translation of said document, data stream, or 
non-text file and its constituent or dependent elements; 
and 

dynamically routing and sequencing said document, data 
stream, or non-text file and its constituent or dependent 
elements to the appropriate translation resources, 
wherein said routing and sequencing is performed 
according to any of: the subject matter of the document 
to be processed, target language of the translation, and 
whether draft-only or high quality is required. 

2. The process of claim 1, further comprising the steps of: 
analyzing all of the project components in sufficient detail 

to determine project cost and resource requirements; 
generating a project cost estimate prior to beginning the 
project; 

creating Work Packets, Job Tickets and a Project Schedule 
for said project; and 

sending said Project Schedule to the user for review and 
validation. 

3. The process of claim 2, further comprising the step of: 
adjusting the Project Schedule when there are changes in 

the availability of translation resources, or when 
manual changes are made to the project by a user. 
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4. The process of claim 1, further comprising the steps of: 
providing management status and reporting as tbe trans- 
lation process progresses to the user; 

automatically notifying the user of the completion of 
translation; 

updating customer and translation resource accounting at 
said translation completion; and 

coordinating the delivery of the translated documents, 
data streams, or non-text files and their constituent or 
dependent elements back to the user for installation into 
a Web site, database, or file system or for optional 
review. 

5. A process for managing, tracking, accounting and 
translating multilingual electronic content in a computer 
environment, comprising the steps of: 

detecting when a document, data stream, or non-text file 
in a master language has been updated; 

notifying the user which corresponding documents, data 
streams, or non-text files in the other languages require 
translation; 

adding new language content to a Web site, database, or 
file system, wherein the user selects said new language 
from a pulldown list; 

automatically initiating the translation of a document, 
data stream, or non-text file into the desired language; 

converting said document, data stream, or non-text file 
requiring translation to a suitable internal format; 

staging the translation of said document, data stream, or 
non-text file; and 

dynamically routing and sequencing said document, data 
stream, or non-text file to the appropriate translation 
resources, wherein said routing and sequencing is per- 
formed according to the subject matter of the document 
to be processed, target language of the translation, and 
whether draft-only or high quality is required. 

6. The process of claim 5, further comprising the steps of: 
tracking the versions of a translated document, data 

stream, or non-text file, wherein said tracking step 
allows the easy integration with a third-party authoring 
or document management system; and 
controlling access to a document, data stream, or non-text 
file and its constituent or dependent elements being 
translated. 

7. A process for managing, tracking, accounting and 
translating multilingual electronic content in a computer 
environment, comprising the steps of: 

detecting when a document, data stream, or non-text file 

in a master language has been updated; 
notifying the user which corresponding documents, data 

streams, or non-text files in the other languages require 

translation; 

sending documents, data streams, or non-text files to be 
translated down a pipeline, wherein said pipeline is 
connected to a plurality of translation resources using 
one or more open Application Prograrnming Interfaces 
(API); 

sending packets to translation resources through said 
Adaptor, wherein work packets are converted to the 
appropriate translation resource format before sending 
said converted work packet to said translation resource; 

receiving packets from translation resources through said 
Adaptor, wherein received packets are converted back 
to the work packet format and the status and control 
information are updated in said work packet before 
sending said converted received packet to said pipeline; 
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wherein said one or more open API allow a variety of 
translation resources to be connected to said pipeline; 
and 

wherein said translation resources perform translation or 
other linguistic functions on a set of input documents 
and produce a set of output documents that have been 
transformed linguistically. 

8. The process of claim 7, wherein said translation 
resources are deployed across intranets as well as the Inter- 
net. 

9. An apparatus for managing, tracking, accounting and 
translating multilingual electronic content in a computer 
environment, comprising: 

a module for detecting when a document, data stream, or 
non-text file in a master language has been updated; 

a module for notifying the user which corresponding 
documents, data streams, or non-text files in the other 
languages require translation; 

a module for allowing the user to initiate the translation of 
a document, data stream, or non-text file and its con- 
stituent or dependent elements; 

a module for converting said document, data stream, or 
non-text file and its constituent or dependent elements 
requiring translation to an internal format; 

a module for staging the translation of said document, 
data stream, or non-text file and its constituent or 
dependent elements; and 

a module for dynamically routing and sequencing said 
document, data stream, or non-text file and its constitu- 
ent or dependent elements to the appropriate translation 
resources, wherein said routing and sequencing is per- 
formed according to any of: the subject matter of the 
document to be processed, target language of the 
translation, and whether draft-only or high quality is 
required. 

10. The apparatus of claim 9, further comprising: 

a module for analyzing all of the project components in 
sufficient detail to determine project cost and resource 
requirements; 

a module for generating a project cost estimate prior to 

beginning the project; 
a module for creating Work Packets, Job Tickets and a 

Project Schedule for said project; and 
a module for sending said Project Schedule to the user for 

review and validation. 

11. The apparatus of claim 10, further comprising: 

a module for adjusting the Project Schedule when there 
are changes in the availability of translation resources, 
or when manual changes are made to the project by a 
user. 

12. The apparatus of claim 9, further comprising: 

a module for providing management status and reporting 
as the translation process progresses to the user; 

a module for automatically notifying the user of the 
completion of translation; 

a module for updating customer and translation resource 
accounting at said translation completion; and 

a module for coordinating the delivery of the translated 
documents, data streams, or non-text files and their 
constituent or dependent elements back to the user for 
installation into a Web site, database, or file system or 
for optional review. 

13. The apparatus of claim 9, further comprising: 

a module for tracking the versions of a translated 
document, data stream, or non-text file, wherein said 
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tracking step allows the easy integration with a third- 
party authoring or document management system; and 
a module for controlling access to a document, data 
stream, or non-text file and its constituent or dependent 
elements being translated. 

14. An apparatus for managing, tracking, accounting and 
translating multilingual electronic content in a computer 
environment, comprising: 

a module for detecting when a document, data stream, or 

non-text file in a master language has been updated; 
a module for notifying the user which corresponding 

documents, data streams, or non-text files in the other 

languages require translation; 
a module for adding new language content to a Web site, 

database, or file system, wherein the user selects said 

new language from a pulldown list; 
a module for automatically initiating the translation of a 

document, data stream, or non-text file into the desired 

language; 

a module for converting said document, data stream, or 
non-text file requiring translation to a suitable internal 
format; 

a module for staging the translation of said document, 
data stream, or non-text file; and 

a module for dynamically routing and sequencing said 
document, data stream, or non-text file to the appro- 
priate translation resources, wherein said routing and 
sequencing is performed according to the subject mat- 
ter of the document to be processed, target language of 30 
the translation, and whether draft-only or high quality 
is required. 

15. An apparatus for managing, tracking, accounting and 
translating multilingual electronic content in a computer 
environment, comprising: 35 

a module for detecting when a document, data stream, or 
non-text file in a master language has been updated; 

a module for notifying the user which corresponding 
documents, data streams, or non-text files in the other 
languages require translation; 

a module for sending documents, data streams, or non- 
text files to be translated down a pipeline, wherein said 
pipeline is connected to a plurality of translation 
resources using one or more open Application Pro- 
gramming Interfaces (API); 

a module for sending packets to translation resources 
through said Adaptor, wherein work packets are con- 
verted to the appropriate translation resource format 
before sending said converted work packet to said 
translation resource; 

a module for receiving packets from translation resources 
through said Adaptor, wherein received packets are 
converted back to the work packet format and the status 
and control information are updated in said work 
packet before sending said converted received packet to 
said pipeline; 

wherein said one or more open API allow a variety of 
translation resources to be connected to said pipeline; 
and 

wherein said translation resources perform translation or 
other linguistic functions on a set of input documents 
and produce a set of output documents that have been 
transformed linguistically. 

16. The apparatus of claim 15, wherein said translation 
resources are deployed across intranets as well as the Inter- 
net. 
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17. A program storage medium readable by a computer, 
tangibly embodying a program of instructions executable by 
the computer to perform method steps for managing, 
tracking, accounting and translating multilingual electronic 
content in a computer environment, comprising the steps of: 

detecting when a document, data stream, or non-text file 
in the master language has been updated; 

notifying the user which corresponding documents, data 
streams, or non-text files in the other languages require 
translation; 

allowing the user to initiate the translation of a document, 

data stream, or non-text file and its constituent or 

dependent elements; 
converting said document, data stream, or non-text file 

and its constituent or dependent elements requiring 

translation to an internal format; 
staging the translation of said document, data stream, or 

non-text file and its constituent or dependent elements; 

and 

dynamically routing and sequencing said document, data 
stream, or non-text file and its constituent or dependent 
elements to the appropriate translation resources, 
wherein said routing and sequencing is performed 
according to any of: the subject matter of the document 
to be processed, target language of the translation, and 
whether draft-only or high quality is required. 

18. The method of claim 17, further comprising the steps 
of: 

analyzing all of the project components in sufficient detail 
to determine project cost and resource requirements; 

generating a project cost estimate prior to beginning the 
project; 

creating Work Packets, Job Tickets and a Project Schedule 

for said project; and 
sending said Project Schedule to the user for review and 

validation. 

19. The method of claim 18, further comprising the step 
of: 

adjusting the Project Schedule when there are changes in 
the availability of translation resources, or when 
manual changes are made to the project by a user. 

20. The method of claim 17, further comprising the steps 
of: 

providing management status and reporting as the trans- 
lation process progresses to the user; 

automatically notifying the user of the completion of 
translation; 

updating customer and translation resource accounting at 
said translation completion; and 

coordinating the delivery of the translated documents, 
data streams, or non-text files and their constituent or 
dependent elements back to the user for installation into 
a Web site, database, or file system or for optional 
review. 

21. The method of claim 17, further comprising the steps 
of: 

tracking the versions of a translated document, data 
stream, or non-text file, wherein said tracking step 
allows the easy integration with a third-party authoring 
or document management system; and 

controlling access to a document, data stream, or non-text 
file and its constituent or dependent elements being 
translated. 

22. A program storage medium readable by a computer, 
tangibly embodying a program of instructions executable by 
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the computer to perform method steps for managing, 
tracking, accounting and translating multilingual electronic 
content in a computer environment, comprising the steps of: 
detecting when a document, data stream, or non-text file 

in the master language has been updated; 
notifying the user which corresponding documents, data 
streams, or non-text files in the other languages require 
translation; 

adding new language content to a Web site, database, or 
file system, wherein the user selects said new language 
from a pulldown list; 

automatically initiating the translation of a document, 
data stream, or non-text file into the desired language; 

converting said document, data stream, or non-text file 
requiring translation to a suitable internal format; 

staging the translation of said document, data stream, or 
non-text file; and 

dynamically routing and sequencing said document, data 
stream, or non-text file to the appropriate translation 
resources, wherein said routing and sequencing is per- 
formed according to the subject matter of the document 
to be processed, target language of the translation, and 
whether draft-only or high quality is required. 

23. A program storage medium readable by a computer, 
tangibly embodying a program of instructions executable by 
the computer to perform method steps for managing, 
tracking, accounting and translating multilingual electronic 
content in a computer environment, comprising the steps of: 

detecting when a document, data stream, or non-text file 
in the master language has been updated; 
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notifying the user which corresponding documents, data 
streams, or non-text files in-the other languages require 
translation; 

sending documents, data streams, or non-text files to be 
translated down a pipeline, wherein said pipeline is 
connected to a plurality of translation resources using 
one or more open Application Programming Interfaces 
(API); 

sending packets to translation resources through said 
Adaptor, wherein work packets are converted to the 
appropriate translation resource format before sending 
said converted work packet to said translation resource; 

receiving packets from translation resources through said 
Adaptor, wherein received packets are converted back 
to the work packet format and the status and control 
information are updated in said work packet before 
sending said converted received packet to said pipeline; 
wherein said one or more open API allow a variety of 
translation resources to be connected to said pipeline; 
and 

wherein said translation resources perform translation or 
other linguistic functions on a set of input documents 
and produce a set of output documents that have been 
transformed linguistically. 
24. The method of claim 23, wherein said translation 
resources are deployed across intranets as well as the Inter- 
net. 
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